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INTRODUCTION 


Proliferation  of  mammary  and  uterine  tissues  is  governed  in  large  part  by  the  steroid  hormones  (Beato  et 
al.,  1995),  which  trigger  activation  of  genes  that  regulate  the  cell  cycle  (Kamei  et  al.,  1996;  Sicinski  et  al., 

1995) .  Hormone  binding  stimulates  receptor  binding  to  response  elements  upstream  of  target  genes. 

Gene  activation  by  the  steroid  hormone  receptors  requires  several  cofactors,  including  the  CREB  binding 
protein  (CBP)  or  the  highly  related  p300  protein,  as  well  as  SRC-1,  PCAF,  and  ACTR  (Chen  et  al.,  1997; 
Kamei  et  al.,  1996;  Onate  et  al.,  1995;  Spencer  et  al.,  1997;  Yang  et  al.,  1996).  Strikingly,  all  of  these 
cofactors  possess  histone  acetyltransferase  activity.  The  histones  fold  DNA  into  chromatin  and  this 
folding  is  generally  inhibitory  to  transcription.  Post-translational  acetylation  of  the  histones  modulates 
chromatin  folding  and  is  a  key  step  in  the  regulation  of  gene  expression  (Turner,  1991;  Wolffe  and  Pruss, 

1996) .  Histone  acetylation  levels  are  governed  by  the  opposing  actions  of  two  groups  of  enzymes,  the 
histone  acetyltransferases  (HATs)  and  the  histone  deacetylases  (HDACs).  Experiments  supported  by  this 
IDEA  grant  were  focused  on  understanding  the  role  of  the  PCAF  histone  acetyltransferase  in  steroid 
hormone  responses  in  normal  mammary  development  and  in  abnormal  events  that  lead  to  cancer.  Our 
original  Specific  Aims  were  to  1)  complete  the  isolation  and  cloning  of  mouse  PCAF  cDNA  and  genomic 
sequences  2)  to  characterize  murine  PCAF  as  a  histone  acetyltransferase  and  3)  examine  PCAF 
interactions  with  specific  steroid  receptors  and  CBP/p300  4)  to  examine  whether  acetylase  activity  is 
required  for  transcriptional  activation  by  the  hormone  receptors  5)  to  create  mice  deficient  for  PCAF  and 
mice  that  overexpress  this  factor  to  assess  the  role  of  PCAF  in  mouse  development,  differentiation,  and 
tumorigenesis. 


BODY 


Specific  Aim  1:  to  complete  the  isolation  and  cloning  of  mouse  PCAF  cDNA  and  genomic 
sequences.  As  reported  previously,  this  specific  aim  was  completed  and  the  cloning  of  the  mouse  PCAF 
cDNA  and  genomic  sequences  was  reported  in  a  paper  we  published  in  Molecular  and  Cellular  Biology 
(Xu  et  al,  1998;  attached  in  appendix;  see  Figs.  1,  2,  and  3).  This  paper  also  reports  the  cloning  of  a 
highly  related  gene  encoding  the  GCN5  histone  acetyltransferase  and  our  finding  that  GCN5  is 
genetically  linked  to  the  BRCA1  locus. 

Specific  Aim  2:  to  characterize  murine  PCAF  as  a  histone  acetyltransferase.  This  specific  aim  was 
also  completed,  as  reported  previously.  The  results  are  depicted  in  Figs.  6  and  7  of  Xu  et  al,  1998 
(attached  in  appendix). 

Specific  Aim  3:  to  examine  PCAF  interactions  with  specific  steroid  receptors  and  CBP/p300.  As 

reported  last  year,  we  and  others  have  described  the  binding  of  PCAF  to  CBP/p300  ( Xu  et  al,  1998  and 
(Yang  et  al.,  1996)).  Tissue  culture  experiments  have  been  performed  by  others  to  examine  the 
importance  of  PCAF  to  hormone  responses  (Chen  et  al.,  1997),  and  they  have  found  that  PCAF  augments 
estrogen  responses.  Our  basic  premise,  then,  that  PCAF  is  important  for  these  processes  is  supported  by 
these  cell  culture  results.  However,  our  results  below  with  the  PCAF  knock  out  mice  indicate  that  PCAF 
is  not  required  for  these  processes  in  whole  animals.  Therefore,  we  have  completed  all  the  relevant 
experiments  in  this  aim  and  PCAF  interactions  with  the  hormone  receptors  will  not  be  pursued  further. 

Specific  Aim  4:  to  examine  whether  PCAF  acetylase  activity  is  required  for  transcriptional 
activation  by  the  hormone  receptors.  As  reported  last  year,  these  experiments  were  superceded  by  our 
findings  with  the  PCAF  knockout  mice,  which  indicate  PCAF  is  not  needed  for  hormone  responses  in 
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mice.  We  proposed  last  year  to  modify  this  aim  to  make  ‘acetylase  minus’  forms  of  GCN5  for  gene 
replacement  studies  in  mice.  We  have  now  completed  the  cloning  of  these  mutations  and  are  now 
assembling  them  into  our  gene  replacement  vector. 

Specific  Aim  5:  to  create  mice  deficient  for  PCAF  and  mice  that  overexpress  this  factor  to  assess  the 
role  of  PCAF  in  mouse  development,  differentiation,  and  tumorigenesis.  As  reported  last  year,  the 
first  half  of  this  aim,  to  create  PCAF  deficient  mice,  is  completed.  Since  PCAF  and  GCN5  are  very 
highly  related,  we  also  created  mice  deficient  for  GCN5.  Our  creation  and  analysis  of  these  mice  is 
reported  in  a  manuscript  recently  accepted  for  publication  in  Nature  Genetics  (Xu  et  al,  2000;  manuscript 
attached  in  appendix).  We  found  that  loss  of  PCAF  had  no  adverse  effects  on  mouse  development  (see 
Fig.  1  in  Xu  et  al,  2000)  or  in  adult  mice.  Estrogen-regulated  processes  such  as  pregnancy  and  lactation 
were  also  unaffected.  In  contrast,  loss  of  the  highly  related  GCN5  protein  causes  embryonic  lethality  (see 
Fig.  2,  Xu  et  al,  2000).  We  determined  that  this  lethality  is  due  to  an  increase  in  programmed  cell  death 
of  particular  mesodermal  lineages.  These  exciting  results  suggest  that  GCN5  regulates  apoptosis  and  thus, 
is  likely  be  important  in  regulation  of  tumor  formation  or  growth. 

To  relate  the  above  accomplishments  to  our  Statement  of  Work: 

•  year  1:  All  goals  were  completed  except  for  the  last  one,  to  clone  an  MMTV-PCAF  transgene.  This 
goal  became  when  we  determined  that  our  PCAF  deficient  mice  exhibit  no  abnormal  phenotype. 

•  year  2:  All  goals  were  completed,  except  for  the  last  one,  which  again  involves  the  MMTV-PCAF 
transgene. 

•  year  3:  Goal  1  is  still  in  progress;  we  are  now  cloning  ‘knock  in  vectors’  for  PCAF  and  GCN5  that 
will  contain  either  wild  type  or  ‘acetylase  minus’  versions  of  these  genes.  Goal  2  is  also  completed. 
We  monitored  the  PCAF  knock  out  mice  for  spontaneous  tumors  for  more  than  2  years  but  saw  no 
increased  incidence  or  rate  of  tumor  formation.  Goal  3,  to  examine  mammary  tissue  in  the  PCAF 
knock  out  mice,  was  completed  and  reported  previously.  We  are  now  cloning  of  a  breast-specific 
knock  out  vector  for  GCN5  (revised  goal  4).  When  these  mice  are  created,  we  will  examine  the 
expression  and  functions  of  the  steroid  receptors  (goal5).  These  remaining  experiments  are  now 
supported  by  a  new  grant  from  the  USAMRC  (see  below). 


KEY  RESEARCH  ACCOMPLISHMENTS 


Reported  in  Xu  et  al,  1998: 

•  Isolation  and  cloning  of  the  murine  PCAF  and  GCN5  genes. 

•  Determination  that  both  PCAF  and  GCN5  interact  with  CBP/p300 

•  Determination  that  GCN5  exists  in  a  long  and  short  form  in  mammalian  cells 

•  Determination  that  GCN5  and  PCAF  have  identical  substrate  specificities 

•  Determination  that  GCN5  and  PCAF  are  expressed  in  complementary  patterns  in  adult  mice 

•  Determination  that  GCN5  is  genetically  linked  to  the  BRCA1  locus 

Reported  in  Xu  et  al,  2000 

•  Determination  that  PCAF  is  dispensable  in  mice 

•  Determination  that  GCN5  is  essential  for  mouse  development 

•  Determination  that  double  mutations  in  GCN5  and  PCAF  cause  more  severe  defects  in  mouse 
development  than  does  loss  of  either  gene  alone,  indicating  that  both  HATs  contribute  to  normal 
embryogenesis. 

•  Determination  that  specific  mesodermal  lineages  are  missing  in  the  GCN5  mutant  mice 
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•  Determination  that  loss  of  mesodermal  lineages  in  the  absence  of  GCN5  is  due  to  a  25  fold  increase 
in  apoptosis  in  these  tissues. 


REPORTABLE  OUTCOMES 


Manuscripts: 

1) .  Xu,  W.,  Edmondson,  D.  G.,  and  Roth,  S.Y.  (1998).  Mammalian  GCN5  and  PCAF  acetyltransferases 
have  homologous  amino  terminal  domains  important  for  recognition  of  nucleosomal  substrates. 
Molecular  and  Cellular  Biology  18,  5659-5669 

2)  Xu,  W.,  Edmondson,  D.G.,  Evrard,  Y.A.,  Wakamiya,  M.,  Behringer,  R.R.,  and  Roth,  S.Y.  (2000). 
Loss  of  Gcn5  leads  to  increased  apoptosis  and  mesodermal  defects  during  mouse  development.  Nature 
Genetics,  in  press. 

Abstracts: 


Evrard,  Y.  Xu,  W.,  Edmondson,  D.G.,  Wakamiya,  M.,  Behringer,  R.R.,  and  Roth,  S.Y.  Role  of 
GCN5  in  apoptosis,  tumor  suppression  and  breast  development;  presented  at  the  DOD,  USAMRMC 
Era  of  Hope  Meeting,  Atlanta,  Georgia,  June  8-12,  2000. 

Evrard,  Y.  Xu,  W.,  Edmondson,  D.G.,  Wakamiya,  M.,  Behringer,  R.R.,  and  Roth,  S.Y.  Analysis 
of  mice  null  for  two  highly  related  histone  acetyltransferases;  presented  at  the  Southwest 
Regional  Society  for  Developmental  Biology  Meeting,  Houston,  Texas,  March  10-13,  2000 


Degrees  obtained: 

The  Ph.D.  thesis  research  of  my  former  student,  Wanting  Xu,  was  supported  largely  by  this  grant, 
although  she  had  a  private  fellowship  that  paid  her  stipend.  Dr.  Xu  completed  her  Ph.D.  in  May,  1999. 
Another  Ph.D.  student,  Anjanette  Watson,  was  also  supported  partially  by  this  grant.  Anjie  completed  her 
Ph.D.  in  May,  2000. 

Funding  obtained: 

Based  in  part  on  the  results  reported  here,  I  was  awarded  another  IDEA  grant  entitled  “Role  of  GCN5  in 
estrogen  response,  tumor  suppression,  and  breast  development  in  mice’  (BC990781),  which  began  in  June 
2000.  I  have  also  received  funding  to  study  the  importance  of  GCN5  in  regulating  apoptosis  during 
normal  fetal  development  in  mice  from  the  March  of  Dimes  Foundation  (FY00-176). 


CONCLUSIONS 

We  discovered  that  the  PCAF  gene  is  very  highly  related  in  structure  and  in  function  to  the  GCN5  gene. 
However,  loss  of  these  two  genes  has  very  different  consequences  in  the  mouse.  PCAF  is  apparently 
dispensable.  This  result  is  especially  surprising  because  a  number  of  studies  in  tissue  culture  cells 
indicate  that  PCAF  is  important  to  hormone  responses,  including  estrogen  responses,  and  to  muscle  cell 
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differentiation.  The  absence  of  abnormalities  in  hormone  dependent  processes  in  PCAF  deficient  mice 
indicates  either  that  PCAF  is  not  important  for  these  processes  or  that  redundant  activities,  like  GCN5, 

‘fill  in’  when  PCAF  is  missing.  In  contrast,  loss  of  GCN5  results  in  embryonic  death  due  to  an  increase  in 
apoptosis.  This  finding  further  supports  the  idea  that  GCN5  and  PCAF  may  be  important  for  regulation  of 
tumorigenesis,  since  programmed  cell  death  is  often  altered  in  tumor  cells.  We  have  also  found  that 
GCN5  is  genetically  linked  to  BRCA1  in  mouse.  Others  have  found  this  same  linkage  in  humans.  This 
region  of  the  human  genome  often  exhibits  loss  of  heterozygosity  in  spontaneous  breast  and  ovarian 
cancers,  which  do  not  exhibit  mutations  in  BRCA1  or  BRCA2.  We  are  now  investigating  how  GCN5 
regulates  apoptosis,  how  this  regulation  might  be  tied  to  tumorigenesis,  and  how,  or  if,  GCN5  is 
important  to  mammary  development.  These  experiments  are  important  for  discerning  how  histone 
acetyltransferases  contribute  to  hormone  responses  and  apoptosis,  and  thus  may  yield  invaluable 
information  for  the  prevention  or  treatment  of  hormone  responsive  breast  cancers. 
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The  yeast  transcriptional  adapter  Gcn5p  serves  as  a  histone  acetyltransferase,  directly  linking  chromatin 
modification  to  transcriptional  regulation.  Two  human  homologs  of  Gcn5p  have  been  reported  previously, 
hsGCN 5  and  hsP/CAF  (p300/CREB  binding  protein  [CBP] -associated  factor).  While  hsGCNS  was  predicted  to 
be  close  to  the  size  of  the  yeast  acetyltransferase,  hsP/CAF  contained  an  additional  356  amino-terminal 
residues  of  unknown  function.  Surprisingly,  we  have  found  that  in  mouse,  both  the  GCN5  and  the  PICAF  genes 
encode  proteins  containing  this  extended  amino-terminal  domain.  Moreover,  while  a  shorter  version  of  GCN5 
might  be  generated  upon  alternative  or  incomplete  splicing  of  a  longer  transcript,  mRNAs  encoding  the  longer 
protein  are  much  more  prevalent  in  both  mouse  and  human  cells,  and  larger  proteins  are  detected  by 
GCN5-specific  antisera  in  both  mouse  and  human  cell  extracts.  Mouse  GCN5  (mmGCNS)  and  mmPjCAF  genes 
are  ubiquitously  expressed,  but  maximum  expression  levels  are  found  in  different,  complementary  sets  of 
tissues.  Both  mmP/CAF  and  mmGCN5  interact  with  CBP/p300.  Interestingly,  mmGCN5  maps  to  chromosome 
11  and  cosegregates  with  BRCA1 ,  and  mmPjCAF  maps  to  a  central  region  of  chromosome  17.  As  expected, 
recombinant  mmGCN5  and  mmP/CAF  both  exhibit  histone  acetyltransferase  activity  in  vitro  with  similar 
substrate  specificities.  However,  in  contrast  to  yeast  Gcn5p  and  the  previously  reported  shorter  form  of 
hsGCNS,  mmGCN5  readily  acetylates  nucleosomal  substrates  as  well  as  free  core  histones.  Thus,  the  unique 
amino-terminal  domains  of  mammalian  P/CAF  and  GCN5  may  provide  additional  functions  important  to 
recognition  of  chromatin  substrates  and  the  regulation  of  gene  expression. 


Transcription  is  a  complex  process  requiring  the  coordinate 
action  of  multiple  basal  and  transactivating  proteins.  In  eu¬ 
karyotic  cells,  this  process  is  complicated  further  by  the  pack¬ 
aging  of  DNA  into  chromatin.  Nucleosomes  provide  the  fun¬ 
damental  repeat  unit  of  chromatin,  consisting  of  two  molecules 
of  each  of  the  four  core  histones  (H2A,  H2B,  H3,  and  H4)  and 
~146  bp  of  DNA  wound  in  almost  two  turns  around  the  exte¬ 
rior  of  the  histone  octamer  (37).  Individual  nucleosomes  as 
well  as  more  highly  folded  structures  are  generally  inhibitory  to 
the  initiation  of  transcription.  Alterations  in  nucleosomal 
structure  and  in  chromatin  packing  often  accompany  transcrip¬ 
tional  activation  (12). 

Posttranslational  acetylation  of  the  histones  has  long  been 
correlated  with  transcriptional  activation  (36,  39,  40).  Acetyla¬ 
tion  neutralizes  the  charge  associated  with  epsilon  amino 
groups  of  lysine  residues,  thereby  loosening  contacts  between 
the  histones  and  the  negatively  charged  DNA.  Histone  acety¬ 
lation  also  influences  compaction  of  nucleosomal  arrays,  yield¬ 
ing  less  condensed  chromatin  structures  (16).  Both  of  these 
effects  can  increase  transactivator  binding  to  nucleosomal 
DNA,  facilitating  transcriptional  activation. 

A  molecular  basis  for  the  linkage  between  histone  acetyla¬ 
tion  and  gene  activation  was  provided  by  the  discovery  that  the 
yeast  transcriptional  adapter  Gcn5p  serves  as  the  catalytic  sub¬ 
unit  of  a  histone  acetyltransferase  type  A  activity  (5).  Gcn5p  is 
associated  with  two  multisubunit  complexes  in  yeast,  which 
include  Ada  proteins  (Ada2p,  Ada3p,  and  Ada5p)  and/or  cer- 
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tain  Spt  proteins  (6,  14,  18,  24,  25,  31).  These  complexes  are 
required  for  transcriptional  activation  by  particular  transacti¬ 
vators,  including  heterologous  VP16  derivatives  and  endoge¬ 
nous  Gcn4p  (3,  13,  24,  34).  Components  of  the  Gcn5p-Adap 
complex  contact  both  transactivator  proteins  and  basal  tran¬ 
scription  proteins,  thus  providing  an  adapter  or  coactivator 
function,  in  addition  to  histone  acetyltransferase  activity  (2, 
34).  Association  with  both  Ada2p  and  acetyltransferase  activity 
is  required  for  Gcn5p  function  in  vivo  (8,  38). 

Human  homologs  of  GCN5  have  been  cloned  based  on 
sequence  and  functional  similarities  of  their  predicted  prod¬ 
ucts  to  the  yeast  protein.  A  cDNA  predicted  to  encode  a 
protein  of  similar  size  and  with  overall  homology  to  yeast 
Gcn5p  has  been  described  (7,  41).  A  human  ADA2  gene  has 
also  been  cloned,  indicating  a  conservation  of  adapter  and 
histone  acetyltransferase  functions  across  species  (7).  In  addi¬ 
tion,  a  cDNA  encoding  a  second,  larger  Gcn5-related  protein 
that  possesses  unique  sequences  in  its  amino-terminal  half  has 
been  identified.  This  protein,  P/CAF  (p300/CREB  binding 
protein  [CBP] -associated  factor),  associates  with  two  highly 
related  proteins,  p300  and  CBP,  that  have  a  region  of  homol¬ 
ogy  with  ADA2  (41).  Interestingly,  p300  and  CBP  are  also 
histone  acetyltransferases  (1,  29).  Interactions  between  P/CAF 
and  p300  or  CBP  are  disrupted  by  the  viral  El  A  oncogene 
product,  and  this  disruption  is  required  for  cellular  transfor¬ 
mation  by  E1A  (41).  Proper  association  of  these  histone  acetyl¬ 
transferase  activities,  then,  is  extremely  important  for  normal 
cell  growth  (32). 

In  order  to  further  study  the  functions  of  histone  acetyltrans¬ 
ferases  in  the  growth  and  development  of  mammalian  cells,  we 
endeavored  to  isolate  sequences  encoding  mouse  GCN5 
(mmGCN5)  and  mmP/CAF.  To  our  surprise,  although  our 
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mmGCN5  exhibited  98%  identity  with  the  reported  human 
GCN5  (hsGCN5)  sequence,  the  mouse  cDNA  encoded  an 
extended  amino-terminal  domain  with  high  similarity  to  a  cor¬ 
responding  domain  in  P/CAF.  Upon  further  examination,  we 
found  that  the  reported  hsGCN5  cDNA  (41)  may  result  from 
an  incompletely  spliced  transcript,  and  that  a  more  prevalent 
transcript  exists  that  potentially  encodes  a  longer  hsGCN5 
protein  similar  to  that  encoded  by  the  mouse  cDNA  that  we 
isolated.  Moreover,  in  contrast  to  previous  reports  that  yeast 
and  human  GCN5  proteins  acetylate  only  free  core  histones, 
the  full-length  recombinant  mmGCN5  protein  containing  this 
extended  amino-terminal  region  acetylates  both  free  and  nu- 
cleosomal  histones  H3  and  H4.  These  results  suggest  that  this 
additional  domain  in  the  mammalian  GCN5  acetyltransferase 
facilitates  chromatin  recognition.  Interestingly,  P/CAF  and 
GCN5  are  expressed  in  inverse  ratios  in  many  mouse  tissues, 
indicating  that  these  proteins  may  serve  tissue-specific  func¬ 
tions. 

MATERIALS  AND  METHODS 

cDNA  library  screening.  Nested  PCR  with  degenerate  oligomers  and  a  mouse 
embryonic  cDNA  library  (13.5  days  postcoitum  [dpc])  as  the  template  was  per¬ 
formed  to  generate  a  fragment  of  the  mmGCN5  cDNA.  Oligomers  were  chosen 
from  regions  of  sequence  conserved  between  yeast  and  Tetrahymena,*  which 
correspond  to  amino  acids  131  to  244  of  the  yeast  protein  sequence.  A  single 
band  of  123  bp  was  generated  and  cloned  into  pBluescript  (Stratagene).  Se-. 
quencing  revealed  80%  nucleotide  identity  and  94%  identity  at  the  amino  acid 
level  to  the  reported  hsGCN5.  This  PCR  product  and  human  EST  clones 
(IMAGE  clone  no.  243927)  with  similarity  to  GCN5  were  used  together  to 
screen  a  cDNA  library  under  conditions  of  low  stringency  as  previously  described 
(11).  Clones  were  plaque  purified  and  rescued  as  per  the  manufacturer’s  proto¬ 
col.  Sequencing  revealed  two  types  of  clones,  some  with  similarity  to  hsGCN5 
and  some  with  similarity  to  hsP/CAF.  All  of  the  PICAF  clones  contained  only  a 
short  piece  of  P/CAF,  and  rescreening  of  the  library  failed  to  isolate  any  longer 
clones.  Therefore,  an  oligomer  corresponding  to  the  5'-most  sequence  of  mmPl 
CAF  was  used  to  screen  a  10.5-dpc  embryonic  mouse  plasmid  library  with 
GeneTrapper  technology  (Gibco  BRL).  Additional  clones,  corresponding  to 
full-length  P/CAF  sequences,  were  isolated  according  to  the  manufacturer’s 
protocol. 

Genomic  library  screening.  A  mouse  genomic  library,  Lambda  FIXII  (Strat¬ 
agene),  was  screened  by  using  a  mixture  of  a  5'  fragment  of  the  mmGCN5  cDNA 
and  a  5'  fragment  of  the  mmP/CAF  cDNA.  Positive  plaques  were  picked  and 
subjected  to  secondary  screening.  Phage  DNA  was  prepared  from  positive 
plaques  by  standard  procedures.  Genomic  inserts  were  released  from  phage 
DNA  by  Notl  digestion  and  subsequently  subcloned  into  Bluescript  KS(+) 
(Stratagene). 

Sequencing  analysis.  DNA  sequencing  was  performed  by  using  the  Thermo- 
Sequenase  radiolabeled  terminator  cycle  sequencing  kit  (Amersham  Life  Sci¬ 
ence).  Sequencing  amplification  conditions  were  94°C  for  30  s,  55°C  for  30  s,  and 
72°C  for  1  min  for  40  cycles.  Alternatively,  automated  sequencing  was  carried  out 
by  the  Sequencing  Core  Facility  at  the  M.  D.  Anderson  Cancer  Center. 

Sequence  alignment.  Published  sequences  were  obtained  by  searching  the 
GenBank,  PIR-Protein,  and  SWISS-PROT  databases.  Sequence  alignment  was 
carried  out  with  the  Genetics  Computer  Group  (GCG)  (Wisconsin  Package 
version  9.1;  GCG,  Madison,  Wis.)  Pileup  program.  Percent  identity  between  two 
proteins  was  calculated  by  using  the  GCG  Bestfit  program. 

Linkage  analysis  mapping.  Restriction  fragment  length  polymorphisms  for 
mmGCN5  or  mmP/CAF  in  C57BL/6J  and  SPRET/Ei  subspecies  were  deter¬ 
mined  by  using  genomic  DNA  purchased  from  the  Jackson  Laboratory.  The 
Jackson  Laboratory  interspecific  backcross  panel  (C57BL/6JEi  x  SPRET/Ei)F! 
x  SPRET/Ei,  known  as  Jackson  BSS  (33),  was  then  used  to  map  the  chromo¬ 
somal  locations  of  the  mmGCN5  and  mmP/CAF  genes.  Predigested  panels  (Bglll 
digestion  for  P/CAF  or  Xbal  digestion  for  GCN5 )  were  analyzed  by  Southern 
blotting  with  a  GCN5  or  P/CAF  intronic  probe.  Typing  results  were  processed  via 
the  Jackson  Laboratory  database  analysis  (see  http://www.jax.org/resources/doc- 
uments/cmdata  for  raw  data). 

RT-PCR.  Isolation  of  total  RNA  from  various  mouse  tissues  was  performed  as 
described  previously  (10).  RNA  was  digested  with  RNase  A-free  DNase  I  (Am- 
bion)  for  30  min  at  37°C.  Reverse  transcriptase  PCR  (RT-PCR)  was  performed 
with  an  RT-PCR  kit  (Perkin-Elmer)  according  to  manufacturer’s  protocols. 
Reverse  transcription  was  carried  out  at  42°C  for  15  min,  followed  by  heating  at 
95°C  for  5  min.  PCRs  were  carried  out  at  95°C  for  60  s  and  60°C  for  60  s  for  35 
cycles  as  suggested  by  the  manufacturer.  Primer  A  (see  Fig.  3B  for  sequence 
location)  for  RT-PCR  is  CTGGTGCCTGAGAAGAGGAC;  primer  B  (see  Fig. 
3B)  is  CTCCGAAGGTGGCATGGTGAAG. 

RNA  analysis.  Total  RNA  from  adult  mouse  tissues  or  whole  embryos  (13.5 
dpc)  was  extracted  as  described  previously  (10).  RNAs  were  electrophoresed  on 


a  1.1%  agarose  gel  containing  formaldehyde,  along  with  RNA  molecular  size 
markers  (Gibco  BRL).  RNA  was  transferred  to  a  GeneScreen  Plus  membrane 
(NEN  Life  Science)  in  10X  SSC  (IX  SSC  is  0.15  M  NaCl  plus  0.015  M  sodium 
citrate).  Hybridization  was  carried  out  with  mmGCN5-  and  mmP/CAF- specific 
probes. 

GCN5  protein  analysis.  Mouse  embryos  (12.5  dpc)  were  homogenized  in 
radioimmunoprecipitation  assay  buffer  (IX  phosphate-buffered  saline,  1%  Non- 
idet  P-40, 0.5%  sodium  deoxycholate,  0.1%  sodium  dodecyl  sulfate  [SDS],  100  p.g 
of  phenylmethylsulfonyl  fluoride  per  ml,  1  |xg  of  aprotinin  per  ml)  and  then 
centrifuged  at  15,000  x  g  for  20  min  at  4°C.  Supernatant  was  collected  for 
Western  blotting,  and  GCN5  was  immunoprecipitated  with  the  polyclonal  hs- 
GCN5  antibody  (generously  provided  by  Shelley  Berger,  Wistar  Institute)  by  the 
protocol  of  Santa  Cruz  Biotech,  Inc.  HeLa  cell  nuclear  extract  was  kindly  pro¬ 
vided  by  Warren  Liao  and  Yongsheng  Ren  (M.D.  Anderson  Cancer  Center). 

Cloning  and  expression  of  full-length  mmGCNS  and  mmP/CAF .  Comparison 
of  the  mmGCNS  genomic  and  cDNA  clones  revealed  that  the  isolated  cDNA 
lacks  the  sequences  encoding  the  first  74  amino  acids.  These  sequences  (which 
lack  introns)  were  excised  from  the  GCN5  genomic  clone  by  Ncol  and  RreHII 
digestion  and  inserted  into  the  appropriate  position  of  the  cDNA  clone  to 
generate  a  full-length  mmGCNS  cDNA,  as  verified  by  DNA  sequencing.  Full- 
length  mmGCN5  was  subcloned  into  the  Ncol  and  Hindlll  sites  of  the  pRSETB 
vectors  (Invitrogen),  such  that  an  N-terminal  His6  tag  was  fused  in  frame  with  the 
coding  region.  Similarly,  full-length  mmP/CAF  was  subcloned  into  the  BamHl 
and  Kpnl  sites  of  the  pRSETB  vector.  His6-tagged  proteins  were  induced  in 
BL21-DE3  bacterial  cells  by  addition  of  1  mM  IPTG  (isopropyl- [3-D-thiogalac- 
topyranoside).  Recombinant  protein  was  purified  by  using  nickel-nitrilotri acetic 
acid  resin  (Qiagen)  according  to  the  manufacturer’s  protocol.  Purified  recombi¬ 
nant  proteins  were  verified  by  Western  blot  analysis  with  an  antibody  specific  to 
the  His6  tag  (Clontech). 

Acetyltransferase  assays.  Acetyltransferase  assays  were  performed  as  previ¬ 
ously  described  (4,  5).  HeLa  cell  mononucleosomes  or  core  histones  were  the 
kind  gift  of  Jerry  Workman,  and  the  cysteine-linked  peptides  (corresponding  to 
Ithiino  acids  1  to  20  of  H3  or  to  this  same  region  with  substitution  of  acetyl-lysine 
at  positions  9  and  14)  were  the  gift  of  C.  David  Allis.  Calf  thymus  histones  were 
purchased  from  Worthington  Biochemical  Corporation  (Freehold,  N.J.).  Acet¬ 
ylation  assays  were  performed  in  10-  to  30-pJ  volumes  with  either  10  p.g  of 
histones  or  the  indicated  amount  of  synthetic  peptide.  Following  incubation  at 
30°C  for  30  min,  an  aliquot  of  each  reaction  mixture  was  processed  for  liquid 
scintillation  counting  (P81  filter  assay)  as  described  by  Brownell  et  al.  (5),  and 
when  appropriate,  another  aliquot  was  electrophoresed  on  an  SDS-22%  poly¬ 
acrylamide  gel  and  histones  were  visualized  by  Coomassie  blue  staining  and 
autoradiography. 

GST  fusion  protein  interaction  assays.  Glutathione  5-transferase  (GST)- 
CBP/p300  interaction  assays  were  performed  as  described  by  Yang  et  aL  (41) 
except  that  crude  bacterial  lysates  containing  His-tagged  recombinant  P/CAF, 
GCN5,  or  HIRA  were  used  and  the  interactions  were  detected  by  Western 
blotting  with  the  His  tag  antibody. 


RESULTS 

Cloning  of  mmGCNS.  In  order  to  study  the  function  of 
acetyltransferases  in  a  mammalian  system,  we  endeavored  to 
clone  mouse  GCN5  homologs.  First  we  generated  a  fragment 
of  the  mmGCN5  cDNA  using  a  nested  PCR  strategy  employ¬ 
ing  degenerate  primers  homologous  to  conserved  regions  of 
the  yeast  GCN5  and  the  Tetrahymena  p55  genes.  To  further 
enhance  the  probability  of  identifying  GC/V5-related  se¬ 
quences,  this  fragment  was  used  together  with  a  human  GCN5 
EST  to  screen  a  13.5-dpc  mouse  embryonic  cDNA  library 
under  conditions  of  low  stringency  (11).  Multiple  positive 
clones  were  identified,  and  upon  sequencing,  these  were  found 
to  contain  open  reading  frames  predicted  to  encode  proteins 
with  significant  homology  to  either  hsGCN5  or  hsP/CAF  (Fig. 

i). 

One  cDNA  clone  contained  an  open  reading  frame  encod¬ 
ing  756  amino  acids,  and  the  C-terminal  portion  of  this  pre¬ 
dicted  amino  acid  sequence  exhibited  98%  identity  with  the 
reported  hsGCN5  sequence,  but  only  71%  homology  to  the 
hsP/CAF  sequence,  over  the  length  of  the  predicted  proteins 
(Fig.  2A).  We  tentatively  concluded  that  this  cDNA  clone 
likely  contains  the  mmGCN5  gene,  as  confirmed  below. 

We  next  used  a  fragment  from  the  5'  end  of  this  clone  to 
screen  a  library  of  mouse  genomic  sequences.  Three  different 
clones  were  isolated,  and  restriction  analysis  and  sequencing 
indicated  that  all  three  clones  harbored  the  entire  mmGCN5 
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1  50 

MAEPSQAPNP  VPAAQPRPLH  SPAPAPTSTP  APSPASASTP  ApTPAPAPAP 

- —MAE  AG  GAGS  PALP  PA  pIhGSPRTLA 

51  *'  100 

AAAPAGSTGS  GGAGVGSGGD  PARfgLSQQQ  gASQRf&QVg  GhfMMLgK 
TAAGSSASCG  PATAVAAAGT  AEGpGGGGSA  RIAVKKAQLS  SAPRAKKLEK 
101  149 

LGVFSACKAN  ETCKCNGWKN  PKP.PTAfRM  DLQQPAANLS  ElSSSCeBpE 
LGVYSACKAE  gSCKCNGWKN  PNgslfplRG  ILQQIIVsIt  IslRgCSHAp 
ISO  199 

ADHVSHLENV  SEDEINRLLG  MVVDVENLFM  SVHKEEDTDT  KQVYFYLFKL 
AAHVSHLENV  SEEEMDRLLG  IVLDVEYLFT  CVHKEEDADT  KQVYFYLFKL 

200  '  ’  "  . . *  *  “  248 

LRKCILQMTR  PVVEGSL.GS  fPlfKPN fEQ  gfLlf^QTKF  §HlAPREKQ”f 
fcRKSYLQRGK  IvVEGSLEKK  SpFEKPsJEQ  |VNNFVQYKF  |HLPSkIrQT 
24  9  ’  '  .  ™  *  298 


MFELSKMFLL  CLNYWKLETP  A§fIqI§QSE  gVAT^KVNY'f  glLCYCH^PQ' 
TIELAKMFLN  RINYWHLEAP  SQRRLRSPND  DISGYKENYT  RWLCYCNVPQ 
299  . ~  3  348 


SC  DSL  PR  YET  ThVFGRSLLR  SIFTVTgRQL  pgKFgvgKDtf  pVPEKRfiriL 
FCDSLPRYET  TKVFGRTFVR  SVFTIMRRQL  LEQARQEKDK  LPLgKRTLIL 

34  9 .  394 

- ML  gEEIYGAWSP  fWESGpTMPP  |E - |TpL  gPRlASVSAA 

THFPKFLSML  gEEIYGA&SP  WeSGStMPP  Se....GTQL  VPRPATVSAT 
THFPKFLSML  EEEVYSQNSP  iWDQDfLSAS  |RTSPl|i|t  fes|PVTGTA 

395  .  ’  .  4  41 

VVPSTPIFSP  SMGGGSNSSL  SLDgAGAlPM  g.pKRTLPE  NLTLlDpRL 
WPS...FSP  SMGGGSNSSL  SLDSAGTEPM  PAGEKRKLPE  NLTLEDAKRL 
LFSSNSTSHE  QINGGRTSPG  CRgIsIlIaN  | . jSEKRKMNN  SHAPiEAKRS 

442  "  _ 491 

RVMGDIPMEL  VNEVMLTITD  PAAlMLGPETS  LLSANAARDE  T&RLEERRGI 
RVMGDIPMEL  VNEVMLTITD  PAAMLGPETS  LLSANAARDE  TARLEERRGI 
RVMGDIPVEL  INEVMSTITD  PAG MLGPETN  fIISALSARDE  aARLEERRGV 

T$2 -  541 

IEFHVIGNSL  TPKANRRVLL  WLVGLQNVFS  HQLPRMPKEY  IARLVFDPKH 
IEFHVIGNSL  TPKANRRVLL  WLVGLQNVFS  HQLPRMPKEY  IARLVFDPKH 
IEFHVVGNSL  NQKPlsIKKlLM  WLVGLQNVFS  &QLPRMPKEY  8  TfeiLVFDPKH 

542  .  ' .  * .  591 

KTLALIKDGR  VIGGICFRMF  PTQGFTEIVF  CAVTSNEQVK  GYGTHLMNHL 
KTLALIKDGR  VIGGICFRMF  iTQG:FTEIVF  CAVTSNEQVK  GYGTHLMNHL 
KTLALIKDGR  VIGGICFRMF  PSQAFTEIVL  CAVTSNEQVK  GYGTHLMNHL 

592  I  ’ .  ”  “  5TT 

KEYHIKHNIL  YFLT YADEYA  TGYFKKQGFS  iDlKVPKSRY  LpIKDYEG'A 
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KEYHIKHEIL  NFLT YADEYA  IGYFKKQGFS  KEIKIPKTKY  VGYIKDYEGA 

642  ”  §9i 

TLMECELNPR  IPYTELSHYl  jKKQKEIIKKL  IERKQAQIRK  VYPGLSCFKE 
TLMECELNPR  IPYTELSHIT  KKQKEIIKKL  IERKQAQIRK  VYPGLSCFKE 
TLMGCELNPQ  IPYTEFSVII  KKQKEIIKKL  IERKQAQIRK  VYPGLSCFKD 

692  . * . ~ .  741 

Gvrqipvesv  pgiretgwkp  LjGKEKGpLi  IpdOPtElk  |LlA@IKSp 
GVRQIPVESV  PGIRETGWKP  LGKEKGRELk  BpdQLYTTLK  SfLLAblKSHP 
GVRQIPIESI  PGIRETGWKP  SjGKEKSfeEPk  fe'PEQLYSTLK  {MQgVKNfiP 


742  793_ 

sIawpfmepvk  KSEAPDYYEV  IRFPIDLKTM  TERLRSRYYV  TRKLFVADLQ 
£ AWPFMEPVK  KSEAPDYYEV  IRFPIDLKTM  TERLRSRYYV  TR&LFVADLQ 
IN  AWPFMEPVK  RTEAPGYYEV  IRFPMDLKTM  SERLRNRYYV  SKKLFMADLQ 
792  37  830 

RVIANCREYN  PPDSEYCRCA  gALEKFRYff  LKEG^bWK 
RVIANCREYN  PPNSEYCRCA  SALKKFFYFK  LKEGGLIDK 
RVFTNCKEYN  PPpSEYYKCA  SILeKFFFSK  IREAGLIDK 
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FIG.  1.  Alignment  of  the  mmGCNS,  mmP/CAF,  and  reported  hsGCNS  amino  acid  sequences.  Identical  amino  acids  are  shaded.  Amino  acid  deletions  are  indicated 
with  a  dotted  line.  The  locations  of  the  histone  acetyltransf erase  (HAT)/acetyl  coenzyme  A  binding  regions  (20,  38)  and  the  bromodomain  motif  (17)  are  indicated. 
The  full  bromodomain  likely  encompasses  amino  acids  363  to  472  of  hsGCN5  (19). 
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Kozak  consensus:  -CCAUGG 
o  - 

mouse  GCN5  start  site:  GCCAUGG 

reported  human  GCN5  start  site:  TCCAUGC 

FIG.  2.  Comparison  of  GCN5  and  P/CAF  sequences  across  species.  (A)  Schematic  comparisons  of  mmGCN5  and  mmP/CAF  sequences  with  GCN5  and  P/CAF 
proteins  from  other  species.  Published  sequences  were  obtained  by  searching  the  PIR-Protein  and  SWISS-PROT  databases.  Positions  of  the  putative  catalytic  domains 
and  the  bromodomains  are  indicated  above  the  diagram.  Percent  identities  between  proteins  are  indicated  on  the  right.  yGcn5p,  yeast  Gcn5p;  HAT,  histone 
acetyltransf erase;  aa,  amino  acids.  The  dotted  box  indicates  the  existence  of  a  predicted  extended  amino-terminal  region  in  the  hsGCN5  protein.  (B)  Comparison  of 
the  Kozak  consensus  sequence  for  translation  start  sites,  the  predicted  mmGCN5  translation  start  sites,  and  the  previously  reported  hsGCN5  translation  start  site  (41). 
The  underlined  AUG  is  the  codon  for  the  initiator  methionine.  The  A  of  the  AUG  is  designated  +1.  In  the  consensus  sequence,  the  nucleotides  at  positions  —3  and 
+4  have  the  greatest  impact  on  translation  efficiency;  97%  of  vertebrate  mRNAs  have  a  purine  (A  or  G,  preferably  A)  in  position  -3,  and  46%  have  a  G  in  position 
+4  (21,  22). 


gene.  Comparison  of  the  genomic  and  cDNA  clones  of 
mmGCN5  revealed  that  the  cDNA  clone  isolated  as  described 
above  actually  lacked  the  first  74  amino-terminal  codons  and 
that  the  mmGCN5  gene  is  divided  into  19  exons  and  contains 
relatively  small  (85-bp  to  1-kb)  introns  (Fig.  3A).  We  inserted 
sequences  from  the  genomic  clone  containing  the  missing  ami¬ 
no-terminal  codons  into  the  cDNA  clone  to  generate  a  full- 
length  (encoding  830  amino  acids)  recombinant  mmGCN5 
cDNA. 

The  two  previously  reported  hsGCN5  sequences  differ  in  the 
position  of  the  initiating  methionine,  such  that  one  reported 
sequence  contains  49  additional  amino-terminal  amino  acids 
relative  to  the  other  (7, 41).  The  mmGCN5  open  reading  frame 
also  encodes  these  additional  amino  acids,  but  the  open  read¬ 
ing  frame  is  further  extended  for  some  distance  upstream  of 
these  sequences,  potentially  encoding  356  additional  amino 
acids.  The  context  of  the  predicted  translation  initiation  site  in 
this  extended  region  of  mmGCN5  matches  well  the  Kozak 
consensus  sequence  (Fig.  2B)  (21,  22).  Moreover,  the  amino 
acids  in  this  amino-terminal  extension  exhibit  more  than  66% 
identity  to  sequences  in  the  corresponding  regions  of  both 


mouse  (see  below)  and  human  P/CAF,  and  the  length  of  this 
extended  region  is  similar  to  that  of  the  P/CAF  proteins.  These 
data  indicate  that  mmGCN5  encodes  a  protein  that  is  very 
homologous  to  yeast  Gcn5p  and  is  almost  identical  to  the 
previously  reported  hsGCN5  but  that  contains  an  extended 
N-terminal  domain  homologous  to  P/CAF  in  both  size  and 
sequence. 

Incomplete  splicing  might  yield  a  shorter  GCN5  protein  in 
mouse  and  human  cells.  We  were  interested  in  determining 
the  basis  of  the  incongruity  in  size  between  mmGCN5  and  the 
reported  human  cDNA.  Inspection  of  the  mmGCN5  genomic 
sequence  revealed  the  presence  of  an  intron  (intron  6  in  Fig. 
3 A)  10  bp  upstream  of  the  previously  reported  upstream-most 
hsGCN5  translation  initiation  site  (41).  Sequences  highly  sim¬ 
ilar  (91%  identical)  to  these  intronic  sequences  are  also 
present  in  the  predicted  5'  untranslated  region  of  the  reported 
hsGCN5  cDNA  but  are  absent  in  the  mouse  cDNA  we  isolated 
as  described  above.  These  comparisons  suggest  either  that  the 
mouse  and  human  GCN5  genes  are  subject  to  differential  splic¬ 
ing  events,  in  which  this  intron  is  either  removed  (mouse)  or 
retained  (human),  or  that  the  previously  identified  human 
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Et-Br  stained  gel  cDNA  probe  Intron  probe  A 


LIL  THF  PK  * 

CCTCATCCTC  ACTCACTTCC  CCA3GTAAGG  CTCAGTCTGG  CCTCCTGGGA  TTTTACCCCA  AATTCATGTC 

Exon  6  , .  . 

oligo  probe  A 

CTTGCTGTTG  TCCCCTTTTT  TCCAGGAAGG  CTTCCTGGAT  TGGTCCCTCC  TCTCCCTCCA  TGGGCCTTCT 

Intron  6  ^ 

GGGATCTGGG  TGTCTACCTG  GCAGATTTGC  CCATGGCCCA  GAAGCTACTT  GCTAlGTATGT  GCTAGTCTGG 


GGATGGCAG 

Exon  7 


TACATGGCCC  .  CTCCCTACAG  lATTCCTGTCC  ATGCTTGAGG  AAGAGATCTA 


Intron  7 


Exon  8 


FIG.  3.  (A)  Genomic  structure  of  mmGCN5.  Exons  in  the  mmGCN5  gene  are  shown  in  boxes,  while  introns  are  represented  by  the  intervening  lines.  The  thick  line 
indicates  the  intron  that  is  retained  in  some  alternatively  spliced  variants  and  which  is  homologous  with  sequences  found  in  the  previously  reported  5'  untranslated 
region  of  the  hsGCN5  cDNA.  The  asterisks  indicate  in-frame  stop  codons  that  would  prevent  translation  of  the  full-length  protein.  Exon  7  is  the  exon  that  is  skipped 
in  the  mouse  cDNA.  Exon  8  is  the  first  coding  exon  in  the  reported  hsGCN5  cDNA.  The  positions  of  the  translation  start  codon  (ATG)  and  the  termination  codon 
(TAG)  are  indicated.  (B)  Coexistence  of  multiple  forms  of  mammalian  GCN5  transcripts.  Left  panel,  DNase  I-treated  RNAs  from  mouse  kidney,  ovary,  and  embryo 
and  human  HeLa  and  hepatoma  cell  lines  were  RT-PCR  amplified  by  using  primers  A  and  B  shown  in  the  diagram  on  the  right.  RT-PCR  products  were  resolved  on 
an  ethidium  bromide  (Et-Br)-stained  agarose  gel.  Mouse  genomic  DNA  was  also  amplified  under  the  same  conditions.  A  prevalent  product  corresponding  to  the  size 
of  the  mouse  cDNA  (without  introns  6  and  7)  was  amplified  from  all  of  the  RNA  samples,  while  other,  larger  products  corresponding  to  the  size  of  the  reported  human 
cDNA  (containing  intron  6  but  not  intron  7)  were  barely  detected.  Middle  panel,  the  RT-PCR  products  were  transferred  to  nylon  membrane  and  hybridized  with 
mmGCN5  cDNA  sequences.  Right  panel,  the  same  blot  from  the  middle  panel  was  stripped  and  rehybridized  with  a  probe  (probe  A)  specific  to  the  conserved  intron 
6.  (C)  The  RT-PCR  products  described  above  were  gel  purified  and  then  amplified  by  PCR  with  a  nested  pair  of  primers.  The  PCR  products  were  then  subjected  to 
DNA  sequencing.  The  nucleotide  sequence  of  the  317-bp  fragment  is  shown.  The  smaller  (292-bp)  fragment  has  the  same  sequence  as  the  larger  fragment  except  that 
it  lacks  exon  7  sequences.  Exons  are  boxed,  and  introns  are  numbered  underneath.  In-frame  stop  codons  are  in  boldface  and  marked  by  asterisks.  The  mmGCNS  reading 
frame  separated  by  intron  6  and  exon  7  is  shown.  The  initiator  methionine  codon  for  the  reported  hsGCNS  (41)  protein  is  in  boldface  and  underlined. 
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FIG.  4.  Detection  of  both  long  and  short  GCN5  proteins.  (A)  Left  panel,  protein  extracts  were  prepared  from  U20S  cells  or  HeLa  cell  nuclei  and  probed  with 
polyclonal  antibodies  to  hsP/CAF  or  hsGCNS,  as  indicated.  Right  panel,  protein  extracts  were  prepared  from  12.5-dpc  mouse  embryos  or  HeLa  cells  and  probed  with 
the  hsGCN5  antibody.  (B)  GCN5  proteins  were  immunoprecipitated  (IP)  from  12.5-dpc  mouse  embryos  and  then  probed  with  hsGCN5  antibody.  An  unrelated  HIRA 
polyclonal  antibody  was  used  as  a  negative  control. 


cDNA  sequence  is  incomplete.  Interestingly,  a  conserved,  in¬ 
frame  stop  codon  is  found  near  the  beginning  of  intron  6,  and 
retention  of  this  intron  would  prevent  translation  of  the  larger 
protein  in  both  mouse  and  human  cells,  perhaps  yielding  a 
smaller  protein  with  a  size  corresponding  to  that  previously 
predicted  for  hsGCN5. 

To  investigate  the  possibility  of  alternative  (or  incomplete) 
splicing  of  mouse  and  human  GCN5  transcripts,  we  performed 
RT-PCR  on  total  RNA  isolated  from  human  HeLa  cells,  hu¬ 
man  hepatoma  cells,  mouse  kidney,  mouse  ovary,  and  a  13.5- 
dpc  mouse  embryo.  All  RNAs  were  treated  with  RNase-free 
DNase  I  before  RT-PCR  to  remove  any  genomic  DNA  from 
the  samples.  An  mmGCN5  genomic  DNA  clone  was  used  in  a 
separate  reaction,  as  a  positive  control  for  the  presence  of  the 
intron  sequences.  Two  primers  corresponding  to  conserved 
sequences  in  exons  6  and  8,  which  flank  introns  6  and  7  (Fig. 
3A  and  B),  were  used  for  the  amplification.  The  RT-PCR 
products  were  separated  on  an  agarose  gel,  transferred  to  a 
membrane,  and  then  probed  sequentially  with  mmGCNS 
cDNA  sequences  or  intron  6  sequences. 

A  predominant  RT-PCR  product  of  a  size  corresponding  to 
the  spliced  cDNA  (lacking  the  intron)  was  amplified  from 
mouse  embryonic,  kidney,  and  ovarian  RNAs  (lower  band  in 
Fig.  3B).  As  expected,  this  product  was  significantly  smaller 
(126  bp)  than  the  amplification  product  from  the  genomic 
DNA  (about  1  kb),  which  contains  introns  6  and  7.  This  small 
product  hybridized  to  the  mmGCNS  cDNA  sequences  but  not 
to  the  intron  6  probe,  consistent  with  the  removal  of  these 
intronic  sequences  by  splicing.  In  contrast,  two  less  abundant, 
closely  spaced  bands  were  detected  by  both  the  cDNA  and  the 
intron  6  probes.  An  intron  7  probe  hybridized  only  to  the 
genomic  DNA  but  failed  to  detect  any  of  the  RT-PCR  prod¬ 
ucts  (data  not  shown),  suggesting  that  intron  7  had  been  re¬ 
moved  in  all  of  the  transcripts.  Sequencing  of  the  larger,  closely 
spaced  RT-PCR  products  revealed  that  they  represent  two 
alternatively  spliced  variants  of  mmGCN5  (Fig.  3C).  Both  of 
these  variants  retained  intron  6,  but  one  also  contained  a  novel 


25-bp  exon  (exon  7)  located  between  introns  6  and  7.  Intron  7 
was  removed  from  both  of  these  alternatively  spliced  products, 
bringing  the  stop  codons  in  intron  6  to  a  position  just  upstream 
of  the  ATG  sequence  corresponding  to  the  previously  pre¬ 
dicted  translation  start  site  of  hsGCN5.  Together  these  data 
indicate  that  the  predominant  form  of  the  mouse  cDNA  is 
completely  spliced,  lacks  these  stop  codons,  and  therefore  is 
predicted  to  encode  the  longer  version  of  GCN5.  However,  the 
two  minor  RT-PCR  products  that  we  observed  might  encode 
shorter  GCN5  proteins,  consisting  of  the  amino-terminal, 
P/CAF-like  domain  in  isolation  or  of  the  C-terminal  domain, 
which  is  most  similar  to  yeast  GCN5. 

RT-PCR  of  total  RNA  from  human  cells  revealed  a  similar 
mixture  of  completely  and  incompletely  spliced  RNAs.  For 
example,  two  RT-PCR  products  were  generated  from  the  hu¬ 
man  HeLa  cell  and  hepatoma  cell  RNAs.  The  size  of  the  more 
abundant,  smaller  product  again  is  consistent  with  a  spliced 
cDNA  lacking  sequences  homologous  to  the  mouse  intron  6 
and  exon  7,  and  this  product  hybridizes  only  to  cDNA  se¬ 
quences.  The  less  abundant,  larger  product  hybridizes  to  both 
intron  and  exon  sequences  (Fig.  3B,  middle  panel).  We  suggest 
that  the  longer  product  likely  corresponds  to  the  hsGCNS 
cDNA  sequences  previously  reported,  whereas  the  more  prev¬ 
alent,  shorter  form  represents  a  spliced  product  predicted  to 
encode  a  longer  protein  analogous  to  that  encoded  by  the 
mouse  cDNA  isolated  as  described  above. 

Long  GCN5  proteins  are  present  in  both  human  and  mouse 
cells.  To  identify  the  size  of  the  native  mammalian  GCN5 
protein(s),  total  cell  extracts  prepared  from  a  12.5-dpc  mouse 
embryo  or  human  HeLa  cells  were  probed  with  a  polyclonal 
serum  raised  against  the  previously  described  hsGCN5  (gen¬ 
erously  provided  by  Shelley  Berger,  Wistar  Institute).  The 
hsGCN5 -specific  antiserum  detected  a  98-kDa  protein  in  the 
HeLa  cell  nuclear  extracts,  consistent  with  the  predicted  size  of 
the  full-length  GCN5  protein  containing  the  extended  amino- 
terminal  region  (Fig.  4 A,  left  panel).  To  ensure  that  this  band 
corresponded  to  mmGCN5  and  that  the  hsGCN5  antibody  did 
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not  cross-react  with  P/CAF,  we  compared  the  relative  signals 
obtained  with  the  hsGCNS  antibody  and  a  P/CAF  antibody 
(generously  provided  by  Yoshihiro  Nakatani,  National  Insti¬ 
tutes  of  Health)  with  extracts  from  U20S  cells  or  HeLa  cells. 
The  P/CAF  antibody  recognized  a  single  band  in  the  U20S 
extract,  consistent  with  previous  reports  that  P/CAF  is  well 
expressed  in  these  cells  (41),  and  in  the  HeLa  cell  nuclear 
extract.  The  hsGCN5  antibody,  however,  did  not  recognize  any 
proteins  of  a  similar  size  in  either  extract  but  did  recognize  a 
prominent  band  of  —98  kDa  in  the  HeLa  cell  nuclear  extract. 
Therefore,  the  hsGCN5  antibody  does  not  appear  to  cross- 
react  significantly  with  PCAF,  and  we  conclude  that  the  98- 
kDa  protein  recognized  by  this  antibody  in  HeLa  cell  extracts 
is  GCN5. 

The  hsGCN5  antibody  also  recognized  a  faint  60-kDa  band 
(lower  arrow  in  right  panel  of  Fig.  4 A)  in  the  HeLa  cell  ex¬ 
tracts,  close  to  the  predicted  size  of  the  shorter  GCN5  protein 
described  previously  (38)  and  above.  Thus,  both  the  long  and 
short  forms  of  GCN5  appear  to  be  expressed  in  these  cells,  but 
the  longer  form  appears  to  be  predominant.  Interestingly,  the 
long  form  of  GCN5  was  the  only  form  detected  in  mouse 
embryo  extracts.  The  expression  of  GCN5  protein  in  the  em¬ 
bryonic  extracts  is  consistent  with  high  levels  of  GCN5-specific 
RNA  detected  in  these  tissues  (see  Fig.  5).  Moreover,  since 
only  very  low  levels  of  P/CAF  RNA  were  detected  at  this  (or 
any)  stage  of  mouse  embryogenesis  (data  not  shown  and  see 
Fig.  5),  these  data  further  support  our  conclusion  that  the 
hsGCNS  antibody  recognizes  mmGCN5  rather  than  mmP/ 
CAF.  Neither  the  long  nor  the  short  form  of  GCN5  was  de¬ 
tected  by  control,  preimmune  serum  in  either  the  mouse  or 
human  extracts  (data  not  shown). 

We  also  used  the  anti-hsGCN5  serum  to  immunoprecipitate 
GCN5  proteins  from  the  mouse  embryo  extract.  Precipitated 
proteins  were  then  detected  by  Western  blotting  with  the  same 
serum.  Again,  a  98-kDa  protein  was  detected  by  the  hsGCN5 
antibody  but  not  by  a  control  rabbit  serum  (Fig.  4B).  Unfor¬ 
tunately,  the  shorter  form  of  GCN5,  if  it  was  present,  would 
comigrate  with  the  immunoglobulin  G  band  and  thus  could  not 
be  detected  by  this  approach.  Nevertheless,  these  experiments 
confirm  the  presence  of  the  longer  GCN5  protein  in  mouse 
embryos. 

Cloning  of  mmPICAF.  A  second  GCAU-related  cDNA  clone 
that  contained  a  high  degree  of  similarity  to  hsPICAF  was 
isolated  in  our  screen  of  the  mouse  cDNA  library.  Since  all 
initial  clones  appeared  to  be  incomplete,  containing  an  867-bp 
fragment  of  the  cDNA  (relative  to  the  human  sequence),  a 
second  library  was  screened  by  using  GeneTrapper  technology. 
Multiple  full-length  cDNAs  containing  an  open  reading  frame 
predicted  to  encode  813  amino  acids  were  obtained.  This  open 
reading  frame  exhibited  93%  identity  to  the  hsP/CAF  cDNA 
sequence  but  only  75%  identity  to  the  reported  hsGCNS 
cDNA  sequence  (41).  We  therefore  designated  this  clone 
mmPICAF.  Both  the  mmGCN5  and  the  mmP/CAF  sequences 
possess  predicted  catalytic  domains  and  bromodomains  iden¬ 
tified  in  a  number  of  recently  identified  histone  acetyltrans- 
ferases,  including  several  highly  conserved  amino  acids  near 
the  putative  catalytic  center  (Fig.  1). 

Using  a  fragment  from  the  5'  region  of  the  mmPICAF  cDNA 
as  a  probe,  we  identified  multiple  clones  from  a  library  of 
mouse  genomic  sequences  that  contained  PI  CAF  sequences. 
Four  of  these  contained  different  portions  of  the  cDNA  se¬ 
quence.  These  clones  indicate  that  in  contrast  to  the  mmGCN5 
gene,  which  contains  small  introns  (a  few  hundred  base  pairs 
each),  the  mmP/CAF  gene  contains  very  large  introns  (16  to  20 
kb).  Because  of  these  large  introns,  we  have  not  completed 
cloning  of  mmPICAF  genomic  sequences. 
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FIG.  5.  Ubiquitous  and  complementary  expression  of  mmGCN5  and  mmP/ 
CAF.  (Top  panel)  Total  RNA  was  isolated  from  various  mouse  tissues  and 
embryos  as  indicated.  Northern  blot  hybridization  was  performed  with  a  mixture 
of  mmGCNS  and  mmPICAF  cDNA  probes.  Two  transcripts  were  detected  and 
are  indicated  by  arrows  on  the  right.  The  identities  of  the  transcripts  were 
confirmed  by  Northern  blot  hybridization  with  single  GCN5  or  PICAF  probes 
(data  not  shown).  Positions  of  RNA  molecular  size  standards  (in  kilobases)  are 
shown  on  the  left.  (Bottom  panel)  Ethidium  bromide-stained  gel  showing  18S 
rRNAs. 


Interestingly,  several  clones  identified  in  our  genomic 
screens  apparently  contain  a  PfCAF  pseudogene.  No  intronic 
sequences  are  present  in  these  clones,  and  several  base  substi¬ 
tutions,  relative  to  the  cDNA  sequence,  are  scattered  through¬ 
out  the  predicted  coding  region  of  the  pseudogene.  RT-PCR 
analysis  indicates  that  the  pseudogene  is  not  expressed  in  sev¬ 
eral  mouse  tissues  examined,  including  brain,  eye,  heart,  lung, 
liver,  kidney,  thymus,  spleen,  fat,  diaphragm,  small  intestine, 
ovary,  testis,  or  a  13.5-dpc  embryo  (data  not  shown). 

Ubiquitous  but  complementary  expression  of  mmGCNS  and 
mmPICAF.  To  examine  and  compare  the  expression  of 
mmGCN5  and  mmP/CAF,  total  RNA  was  extracted  from  var¬ 
ious  mouse  tissues,  subjected  to  denaturing  electrophoresis, 
transferred  to  a  membrane,  and  then  probed  with  mmGCN5 - 
or  mmP/C4F-specific  sequences. 

A  single  transcript  of  3.3  kb  was  detected  in  all  tissues  with 
the  GCN5  probe,  consistent  with  size  of  the  cDNA  clone  we 
isolated.  Similarly,  a  single,  ubiquitous  transcript  was  detected 
with  the  P/CAF  probe,  and  the  size  of  this  RNA,  4.4  kb,  is 
similar  to  that  of  the  P/CAF  cDNA  that  we  isolated.  Interest¬ 
ingly,  the  P/CAF  RNA  always  exhibited  a  broader  banding 
pattern  than  did  the  GCN5  RNA.  These  two  RNAs  were 
clearly  distinguished  from  one  another  when  probed  on  the 
same  blot,  and  a  differential  pattern  of  expression  was  detected 
(Fig.  5).  For  example,  the  ratio  of  mmGCN5  to  mmP/CAF 
expression  is  higher  in  brain,  thymus,  spleen,  testis,  and  13.5- 
dpc  embryonic  tissue,  while  this  ratio  is  much  lower  in  heart, 
liver,  kidney,  and  skeletal  muscle.  Westerfi -blot  analysis  of 
GCN5  protein  levels  (with  the  polyclonal  antiserum  to  hs¬ 
GCNS  described  above)  in  various  mouse  tissues  confirmed 
the  general  pattern  of  expression  indicated  by  this  RNA  anal¬ 
ysis  (data  not  shown). 

Chromosomal  locations  of  the  mmGCN5  and  mmPICAF 
genes.  The  chromosomal  location  of  the  mmGCN5  gene  was 
mapped  by  standard  linkage  analysis  with  the  Jackson  Labo¬ 
ratory  interspecific  backcross  panel  (C57BL/6Jei  X  SPRET/ 
Ei)Fj  X  SPRET/Ei,  also  known  as  Jackson  BSS  (33). 
mmGCN5  mapped  cleanly  to  a  distal  region  on  chromosome 
11  and  cosegregated  tightly  with  BRCA1 ,  as  well  as  with  a 
number  of  other  genes  previously  mapped  to  that  locus  (data 
not  shown,  but  raw  data  from  the  Jackson  Laboratory  are 
available  at  http://www.jax.org/resources/documents/cmdata). 
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FIG.  6.  Acetylation  of  histone  H3  synthetic  peptides  by  mmGCN5  and  mmP/CAF.  (A  and  B)  Results  of  acetyltransferase  assays  with  recombinant  full-length 
mmGCN5  and  synthetic  peptides  corresponding  to  the  amino-terminal  tail  of  histone  H3.  (C  and  D)  Results  of  peptide  assays  with  recombinant  mmP/CAF.  Peptides 
were  either  unacetylated  (un)  or  synthesized  with  acetyl-lysine  residues  at  either  K9  and  K.18  (di9,18)  or  K9  and  K14  (di9,14)  in  H3.  Vector  indicates  assay  of  a  control 
extract,  made  from  bacteria  transformed  by  the  recombinant  vector  without  an  acetyltransferase  insert,  subjected  to  the  His  tag  purification  procedure. 


Interestingly,  the  hsGCN5  gene  was  recently  mapped  by  fluo¬ 
rescent  in  situ  hybridization  analysis  to  a  syntenic  region  of 
human  chromosome  17  (9)  and  was  also  found  to  cosegregate 
with  human  BRCA1. 

The  location  of  mmP/CAF  was  mapped  in  a  similar  fashion, 
using  the  same  backcross  panel.  In  this  case  we  used  a  probe 
specific  for  intronic  sequences  to  ensure  that  we  mapped  the 
authentic  mmP/CAF  gene  and  not  the  P/CAF  pseudogene. 
This  analysis  indicated  that  mmP/CAF  is  located  32  centimor- 
gans  from  the  centromere  of  mouse  chromosome  17  and  that 
it  cosegregates  with  the  DNA  marker  D17Bir8  (see  www  ad¬ 
dress  above). 

mmGCN5  encodes  a  histone  acetyltransferase  with  sub¬ 
strate  specificity  similar  to  that  of  P/CAF.  The  high  degree  of 
homology  between  the  mouse,  human,  and  yeast  GCN5  pro¬ 
teins  strongly  predicts  that  mmGCN5  and  mmP/CAF  will  ex¬ 
hibit  histone  acetyltransferase  activity.  We  confirmed  this  ini¬ 
tially  by  examining  the  activities  of  the  isolated,  conserved 
acetylase  domains  of  mmGCN5  and  mmP/CAF,  expressed  as 
recombinant  proteins  in  Escherichia  coli.  As  expected,  this 
domain  of  mmGCNS  was  quite  active  as  a  histone  acetylase, 
and  it  preferentially  acetylated  free  (nonnucleosomal)  histone 
H3,  and  to  a  lesser  degree  H4,  as  does  yeast  Gcn5p  (23)  and 
the  previously  reported  form  of  the  hsGCN5  protein  (41). 
Full-length  mmGCN5  and  mmP/CAF  recombinant  proteins 
(also  expressed  in  bacteria)  exhibited  this  same  substrate  spec¬ 
ificity  towards  free  histones  (Fig.  6  and  data  not  shown). 


To  determine  which  residues  of  histone  H3  were  acetylated 
by  mmGCN5,  we  performed  assays  with  synthetic  peptides 
corresponding  to  the  amino-terminal  tail  of  this  histone.  As 
expected,  we  found  that  the  full-length  GCN5  protein  effi¬ 
ciently  acetylated  peptides  corresponding  to  the  first  20  amino 
acids  of  histone  H3  (Fig.  6 A  and  B).  This  domain  alone,  then, 
is  sufficient  for  binding  to  the  enzyme  and  subsequent  catalysis. 
However,  mmGCNS  could  not  acetylate  a  peptide  that  con¬ 
tained  acetyl-lysine  moieties  at  positions  9  and  14  (Fig.  6B), 
suggesting  that  one  or  both  of  these  lysines  may  be  a  target  site 
for  mmGCN5.  In  contrast,  mmGCN5  readily  acetylated  a  pep¬ 
tide  containing  acetyl-lysine  moieties  at  positions  9  and  18  (Fig. 
6A).  Taken  together,  these  data  suggest  that  K14  is  the  pre¬ 
ferred  acetylation  site  in  H3  for  mmGCN5.  Similar  assays 
performed  with  H4  peptides  indicate  that  K8  is  the  preferred 
site  of  acetylation  in  H4  (data  not  shown).  These  results  are 
consistent  with  the  site  specificity  determined  for  recombinant 
yeast  Gcn5p,  which  was  confirmed  by  protein  sequencing  of 
acetylated  histones  (23).  Importantly,  these  results  indicate 
that  the  extended  amino-terminal  domain  of  mmGCN5  does 
not  change  the  histone  or  lysine  residue  specificity  of  the  en¬ 
zyme. 

The  specificity  of  mmP/CAF  was  also  tested  with  the  peptide 
substrates.  In  all  respects,  mmP/CAF  exhibited  a  substrate 
specificity  identical  to  that  of  mmGCNS  (Fig.  6C  and  D). 

One  striking  difference  between  the  previously  reported, 
shorter  form  of  recombinant  hsGCNS  (or  yeast  GcnSp)  and 


'  Vol.  18,’  1998 


GCN5  DOMAIN  IMPORTANT  FOR  NUCLEOSOMAL  ACETYLATION  5667 


Free  histones  Nucleosomes 


FIG.  7.  Acetylation  of  nucleosomal  histones  by  mmGCN5  and  mmP/CAF. 
Acetyl  transferase  assays  were  performed  with  HeLa  cell  mononucleosomes  or 
free  histones  as  indicated,  and  an  aliquot  of  each  assay  mixture  was  resolved  on 
an  SDS-22%  polyacrylamide  gel.  Coomassie  blue-stained  gels  and  correspond¬ 
ing  autoradiographs  are  shown.  In  both  assays,  histones  H3  and  H4  were  acety- 
lated  by  the  recombinant  full-length  mmGCN5. 


recombinant  hsP/CAF  was  the  ability  of  P/CAF  to  acetylate 
nucleosomal  substrates  (23,  41).  Given  the  homology  between 
the  amino-terminal  portions  of  P/CAF  and  mmGCNS,  we 
asked  whether  the  full-length  recombinant  mmGCNS  could 


also  acetylate  histones  within  a  nucleosome.  We  found  that 
mmGCN5,  like  hsP/CAF,  can  acetylate  nucleosomal  H3  and, 
to  a  lesser  degree,  H4  (Fig.  7).  In  agreement  with  previously 
reported  results  (23,  41),  we  also  found  that  the  short  form  of 
mmGCNS  or  yeast  GcnSp  was  unable  to  acetylate  nucleosomes 
(data  not  shown).  These  results  suggest  that  one  function  of 
the  amino-terminal  domains  of  mammalian  GCN5  and  P/CAF 
may  be  to  facilitate  the  recognition  of  chromatin  templates. 

mmGCNS  and  mmP/CAF  both  interact  with  CBP  and  p300. 
hsP/CAF  interacts  with  CBP  and  p300  (41).  Given  the  similar¬ 
ity  between  mmGCN5  and  mmP/CAF,  we  examined  the  abil¬ 
ities  of  both  of  these  proteins  to  bind  to  CBP  or  p300  in  vitro 
(Fig.  8). 

Whole-cell  lysates  from  bacteria  expressing  fragments  of 
CBP  fused  to  GST  (fusion  constructs  were  kindly  provided  by 
Y.  Nakatani,  National  Institutes  of  Health)  (41)  were  mixed 
with  lysates  from  cells  expressing  the  amino-terminal  domain 
of  mmP/CAF,  the  amino-terminal  domain  of  mmGCNS,  or  the 
C-terminal  domain  of  mmGCNS.  The  CBP  fragments  (A  to  F) 
spanned  the  ADA2  homology  domain  and  extended  into  the 
transcriptional  activation  domain  (41).  A  fragment  of  p300 
(B')  homologous  to  the  B  fragment  of  CBP  was  also  tested. 
GST  fusion  proteins  were  purified  together  with  any  interact¬ 
ing  proteins  by  using  glutathione-Sepharose,  and  the  interact¬ 
ing  proteins  were  identified  by  Western  blotting  with  an  anti¬ 
serum  specific  for  the  six-histidine  tag  present  in  the 
recombinant  mmP/CAF  or  mmGCN5  protein. 
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FIG.  8.  mmGCN5  and  mmP/CAF  both  interact  with  CBP  and  p300  in  vitro.  (A)  Fragments  of  CBP  fused  to  GST  that  were  used  for  the  interaction  assays  in  panel 
C.  These  fragments  span  the  region  of  homology  to  ADA2  and  extend  into  the  transactivation  domain  of  CBP  (41).  (B)  Recombinant,  His-tagged  proteins  used  in  the 
interaction  assays  of  panel  C  were  resolved  by  SDS-polyacrylamide  gel  electrophoresis  and  probed  with  an  antibody  specific  for  the  His  tag.  The  amounts  shown 
represent  0.25  or  1%  of  the  protein  used  in  the  assay,  as  indicated.  (C)  GST-CBP  or  -p300  fusion  proteins  were  mixed  with  crude  bacterial  lysates  containing  P/CAF, 
GCN5,  or  HIRA  protein.  Interacting  proteins  were  recovered  by  using  glutathione-Sepharose  and  detected  on  Western  blots  with  the  anti-His  tag  antibody.  The  p300 
B'  fragment  is  homologous  to  the  CBP  B  fragment,  and  the  CBP  AB  fragment  is  missing  residues  1801  to  1851  (41). 
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The  amino-terminal  domain  of  mmP/CAF  selectively  bound 
to  fragments  A  and  B  of  CBP  and  the  corresponding  B'  frag¬ 
ment  of  p300.  In  some  experiments,  we  also  observed  binding 
to  the  D  fragment,  but  we  never  observed  binding  to  fragment 
C,  E,  or  F.  A  deletion  within  the  B  fragment  (AB)  of  CBP  that 
removed  residues  1801  to  1851  eliminated  binding.  This  pat¬ 
tern  of  binding  to  the  CBP/p300  fragments  is  extremely  similar 
to  that  previously  reported  for  hsP/CAF  (41),  as  expected. 

A  recombinant  form  of  hsGCNS,  which  lacked  the  amino- 
terminal  domain  reported  here  for  mmGCN5,  failed  to  bind 
CBP  or  p300  in  previous  experiments  by  Yang  et  al.  (41).  This 
form  of  hsGCNS  corresponds  to  the  C-terminal  region  of 
mmGCN5.  We  therefore  compared  binding  of  the  amino-ter¬ 
minal  and  the  C-terminal  halves  of  mmGCN5  to  the  GST-CBP 
and  -p300  fragments.  Surprisingly,  we  found  that  both  of  these 
mmGCN5  domains  bound  to  CBP  fragments  A  to  D,  with  little 
or  no  binding  to  fragment  E,  fragment  F,  or  the  AB  fragment. 
Both  the  amino-terminal  and  C-terminal  regions  of  mmGCN5 
also  bound  to  the  p300  B'  fragment.  The  amount  of  the  GST 
fusion  proteins  recovered  from  the  GST  columns  that  did  not 
exhibit  binding  to  the  GCN5  fragments  was  greater  than  or 
equal  to  that  of  the  GST  fusions  that  did  exhibit  binding  (data 
not  shown),  so  the  absence  of  binding  was  not  due  to  reduced 
amounts  of  the  E,  F,  or  AB  fragment.  In  addition,  the  selective 
binding  of  the  mmGCN5  peptides  to  CBP  fragments  A  to  D 
indicates  that  these  interactions  are  not  nonspecifically  medi¬ 
ated  by  the  GST  moiety,  since  this  moiety  is  also  present  in 
fragments  E,  F,  and  AB.  The  specificity  of  the  interactions  was 
further  tested  by  using  an  unrelated  protein,  HIRA,  which 
failed  to  bind  to  any  of  the  GST-CBP  or  -p300  fragments. 
Thus,  CBP  fragments  A  to  D  do  not  exhibit  general,  nonspe¬ 
cific  binding  to  random  proteins.  We  conclude  that  mmGCN5 
contains  two  distinct  CBP/p300  interaction  domains  and  that 
these  domains  interact  with  a  broader  region  in  CBP  than  does 
P/CAF.  Importantly,  our  finding  that  mmGCNS  and  mmP/ 
CAF  can  both  interact  with  CBP/p300  indicates  that  these 
proteins  are  very  similar  in  function  as  well  as  in  structure. 

DISCUSSION 

The  recent  identification  of  nuclear  histone  acetyltrans- 
ferases  has  directly  linked  chromatin  modification  with  tran¬ 
scriptional  regulation  (1,  5, 26,  29).  We  report  here  the  cloning 
of  mmGCN5  and  mmP/CAF  sequences.  We  find  that 
mmGCN5  differs  from  yeast  GCN5  and  the  previously  re¬ 
ported  hsGCN5  sequences  in  that  it  encodes  a  large  N-termi- 
nal  domain  similar  to  that  found  in  P/CAF.  Our  data  indicate 
that  hsGCNS  contains  this  extra  domain  as  well.  While  this 
domain  does  not  appear  to  affect  the  histone  specificity  of  the 
acetyltransferase,  it  does  afford  the  enzyme  the  ability  to  mod¬ 
ify  nucleosomal  substrates  in  vitro. 

In  vivo,  both  the  yeast  and  mammalian  enzymes  must  inter¬ 
act  with  and  modify  nucleosomal  histones.  In  yeast,  this  is 
accomplished  by  association  of  Gcn5p  into  high-molecular- 
weight  protein  complexes  that  can  modify  nucleosomes  and 
that  recognize  additional  histones  (14).  At  least  some  of  the 
Gcn5p-associated  proteins  are  conserved  in  higher  eukaryotes, 
and  Gcn5-Ada  complexes  have  been  identified  in  human  cells 
(7),  further  indicating  that  these  enzymes  serve  similar  func¬ 
tions  across  species.  We  scanned  the  yeast  genome  database  to 
determine  whether  a  protein  homologous  to  the  amino-termi¬ 
nal  domains  of  mmGCNS  or  mmP/CAF  might  exist  that  could 
be  a  component  of  the  Gcn5p-containing  complexes.  However, 
we  found  no  such  homologs. 

Interestingly,  a  single  GC/V5-related  gene  has  been  identi¬ 
fied  in  Drosophila.  This  gene  exhibits  ’ high  similarity  to  mam¬ 


malian  P/CAF  (34a)  and  encodes  the  extended  N-terminal 
domain.  Although  this  domain  is  apparently  not  needed  in 
yeast,  its  functions  are  not  restricted  to  mammals. 

Our  work  indicates  that  multiple  differentially  spliced  forms 
of  GCN5  transcripts  coexist  in  both  mouse  and  human  cells, 
which  may  generate  different  isoforms  of  GCN5  proteins.  Of 
course,  we  cannot  rule  out  the  possibility  that  the  less  abundant 
products  represent  incompletely  spliced  RNAs,  but  it  is  inter¬ 
esting  that  intron  6  and  the  stop  codons  therein  are  conserved 
between  human  and  mouse.  Since  we  detected  transcripts  con¬ 
taining  intron  6  in  both  mouse  and  human  tissues,  we  are 
intrigued  by  the  possibility  that  shortened  GCN5  proteins, 
containing  either  the  N-terminal  domain  alone  or  the  C-ter- 
minal  domain  alone,  may  provide  an  additional  level  of  regu¬ 
lation  of  GCN5  functions.  For  example,  we  detected  the  full- 
length  GCN5  protein  in  mouse  embryo  extracts  and  some 
human  cells,  but  we  detected  a  shorter,  less  abundant  protein 
in  HeLa  cells  in  addition  to  the  full-length  protein.  It  will  be 
especially  interesting  to  determine  whether  various  forms  of 
GCN5  proteins  are  differentially  regulated  in  different  cell 
types  or  at  different  developmental  stages. 

The  long  form  of  mmGCN5  is  very  similar  to  mmP/CAF  in 
structure,  in  acetyltransferase  activity  and  substrate  specificity, 
and  in  interactions  with  CBP/p300.  Additional  experiments  are 
needed  to  determine  whether  these  two  proteins  are  function¬ 
ally  redundant  in  vivo.  Even  if  GCN5  and  P/CAF  perform  the 
same  functions,  they  might  be  utilized  at  different  developmen¬ 
tal  stages  or  in  different  cell  types  or  tissues.  The  similarity 
between  these  proteins  is  somewhat  reminiscent  of  that  be¬ 
tween  CBP  and  p300.  These  two  proteins  also  appear  to  be 
functionally  equivalent  in  vitro,  but  mutations  in  p300  and  CBP 
cause  different  phenotypes  (27, 30),  indicating  that  the  proteins 
are  not  functionally  redundant  in  vivo.  It  will  be  interesting  to 
determine  whether  the  same  is  true  for  mmGCN5  and  mmP/ 
CAF. 

Several  histone  acetyltransferases,  including  p300/CBP  and 
P/CAF,  have  been  implicated  in  growth  control  and  tumori- 
genesis  (30,  32,  41).  p300/CBP  physically  interacts  with  the 
tumor  suppressor  p53  and  potentiates  sequence-specific  DNA 
binding  and  transactivation  by  p53  through  acetylation  of  its 
C-terminal  domain  (15).  Moreover,  mutations  in  p300  have 
been  found  in  certain  colorectal  and  gastric  cancers  (27).  CBP 
mutations  are  also  involved  in  the  etiology  of  certain  acute 
myeloid  leukemias  and  Rubinstein-Taybi  syndrome  (30),  a  de¬ 
velopmental  disorder  with  a  high  incidence  of  neoplasms.  In 
addition,  P/CAF  counteracts  the  transforming  activity  elicited 
by  oncoprotein  E1A,  and  overexpression  of  P/CAF  has  been 
shown  to  inhibit  cell  cycle  progression  (41).  Therefore,  histone 
acetyltransferases  have  been  postulated  to  be  negative  regula¬ 
tors  of  cell  growth  and,  possibly,  tumor  suppressors.  We  (this 
study)  and  others  (9)  have  found  that  GCN5  cosegregates  with 
the  tumor  suppressor  BRCA1  gene  (1-centimorgan  interval)  in 
a  highly  syntenic  region  in  mouse  (chromosome  11)  and  hu¬ 
man  (chromosome  17).  Interestingly,  loss  of  heterozygosity  on 
human  chromosome  17  is  a  frequent  genetic  alteration  in  spo¬ 
radic  breast  and  ovarian  cancers,  where  mutations  in  BRCA1 
and  BRCA2  are  rarely  found  (28,  35).  Indeed,  a  novel  tumor 
suppressor  gene  involved  in  these  cancers  has  been  postulated 
to  be  located  adjacent  to  the  BRCA1  locus  (28,  35).  GCN5  may 
provide  an  attractive  candidate  for  this  novel  tumor  suppres¬ 
sor.  The  isolation  and  characterization  of  mmGCNS  and  mmP / 
CAF  reported  here  should  facilitate  further  study  of  the  roles 
of  these  genes  and  of  histone  acetylation  in  normal  mammalian 
development,  as  well  as  in  abnormal  events  leading  to  tumor- 
igenesis. 
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Histone  acetyltransferases  (HATs)  are  important  in  transcriptional  regulation,  but  little  is 
known  about  the  role  of  these  enzymes  in  developmental  processes.  Two  mouse  HATs,  Gcn5 
(Gcn512)  and  Pcaf,  share  similar  sequences  and  enzymatic  activities*.  Both  interact  with 
p300/CBP  (Ep300/Crebbp),  two  other  HATs  that  integrate  multiple  signaling  pathways*.  Pcaf  is 

thought  to  participate  in  many  of  the  cellular  processes  regulated  by  p300/CBP^-^,  but  Gcn5 
functions  are  relatively  understudied  in  mammalian  cells.  Here  we  show  that  Pcaf  is  dispensable  in 
mice.  In  contrast,  GcnS  null  embryos  die  during  embryogenesis.  These  embryos  develop  normally 
to  7.5  dpc  but  are  severely  growth  retarded  by  8.5  dpc  and  fail  to  form  dorsal  mesoderm  lineages, 
including  chordamesoderm  and  paraxial  mesoderm.  Differentiation  of  extraembryonic  and  cardiac 
mesoderm  appears  unaffected.  Strikingly,  loss  of  the  dorsal  mesoderm  lineages  is  due  to  a  high 
incidence  of  apoptosis  in  the  GcnS  mutants  that  begins  before  the  onset  of  morphological 
abnormality.  Embryos  null  for  both  GcnS  and  Pcaf  show  even  more  severe  defects,  indicating  that 
these  HATs  have  overlapping  functions  during  embryogenesis.  Our  studies  are  the  first  to 
demonstrate  that  specific  acetyltransferases  are  required  for  cell  survival  and  mesoderm  formation 
during  mammalian  development. 

Mouse  GcnS  and  Pcaf  are  expressed  ubiquitously,  but  in  complementary  amounts,  in  different 

adult  mouse  tissues*.  During  embryogenesis,  Gcn5  is  expressed  uniformly  throughout  the  embryo  from 
7.5-9.0  days  post  coitum  (dpc),  except  in  the  distal  allantois  and  developing  heart  (Fig.  lc).  Gcn5 

expression  is  down-regulated  after  16.5  dpc  (Fig.  la)  but  is  later  up-regulated  in  specific  adult  tissues*. 
In  contrast,  Pcaf  expression  is  low  during  embryogenesis  and  becomes  up-regulated  in  some  adult  tissues 
including  heart  and  skeletal  muscle  (Fig.  16). 

To  decipher  the  roles  of  these  enzymes  in  mammalian  development,  we  generated  mice  lacking 
Pcaf  or  GcnS  (Figs.  2a,  b,  and  3a-c).  Mice  null  for  Pcaf  are  viable  and  fertile  with  no  obvious  abnormal 
phenotype.  In  addition,  we  observed  no  apparent  up-regulation  of  either  Gcn5  mRNA  (Fig.  2c)  or  protein 
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levels  (data  not  shown).  Mice  heterozygous  for  the  Gcn5  null  allele  were  obtained  at  the  expected 
frequency  and  appeared  normal.  Gcn5  null  mice  appeared  normal  and  exhibited  Mendelian  segregation 
at  7.5  dpc  (data  not  shown).  However  by  8.5  dpc,  Gcn5  null  embryos  were  growth  retarded  and  died  by 
10.5  dpc.  Histological  analysis  indicated  that  Gcn5  null  embryos  lacked  specific  mesodermal  derived 
structures,  such  as  somites,  and  rarely  formed  a  recognizable  notochord  or  neural  tube  (Fig.  4a-/).  An 
ectopic  structure  in  the  exocoelomic  cavity  projects  from  the  anterior  end  of  null  embryos  (Fig.  2d,  4m- 
u).  Histologically  this  structure  resembles  ectodermal  cells  (Fig.  4g-z),  but  it  does  not  express  neural 
(Otx2,  Enl,  and  Hes5\  data  not  shown)  or  mesodermal  lineage  markers  (see  below). 

To  determine  which  mesodermal  derivatives  are  missing  in  the  Gcn5  null  embryos,  we  examined 

the  expression  of  mesoderm  lineage  markers  (Fig.  4 m-u).  rfi(Brachyutry)  and  Fg/8^  were  detected  in  the 
mutant  embryos  at  8.25  dpc  (Fig.  4m-ri)  indicating  that  the  primitive  streak  forms.  However,  we  observed 

deficits  in  the  expression  of  Tcfl5^ !( paraxis )  and  Z>///12  ( Delta-like  1  homolog)  in  8.5  dpc  mutants  (Fig. 

* 

4o,  q),  indicating  that  paraxial  mesoderm  is  disrupted.  Similarly,  no  expression  of  Shh^  ( Sonic 
hedgehog)  or  Foxa2^^{Hnf-3b)  was  observed  along  the  anterior  midline  of  8.5  dpc  Gcn5  null 
embryos  (Fig.  4 r,  s)  indicating  defective  notochord  formation.  Foxa2  expression  was  detected  in  the 
distal  tip  and  the  anterior  lateral  area  of  the  mutant  embryos,  in  the  definitive  endoderm  and  node.  Twist 
expression  was  detected  in  the  allantois  of  the  Gcn5  mutants  but  was  barely  detectable  in  embryonic 

tissues  (Fig.  At),  suggesting  that  head  mesenchyme  formation  is  compromised1*1’17.  Mef2c,  an  early 
marker  for  cardiac  mesoderm1^,  was  expressed  normally  in  the  Gcn5  mutants  at  8.0  (Fig.  4m).  This  result 
is  consistent  with  our  observation  that  the  Gcn5  mutant  embryos  have  heartbeats  at  10.5  dpc  and  suggests 
that  cardiac  mesoderm  formation  is  unaffected  in  these  mutants.  Together,  these  results  indicate  that 
mesoderm  is  formed  in  the  Gcn5  null  embryos,  but  paraxial  mesoderm,  chordamesoderm  and  head 
mesenchyme  do  not  develop  normally. 
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The  lineage-specific  defects  in  mesoderm  formation  might  arise  from  a  lack  of  initial  mesoderm 
specification.  Tcfl5  is  expressed  as  early  as  7.5  dpc  in  wild  type  embryos  so  we  re-examined  Tcfl5 
expression  at  this  stage  in  the  mutant.  Tcfl5  expression  was  observed  between  7.5-8. 0  dpc  (Fig.  4 p), 
indicating  that  paraxial  mesoderm  is  specified  but  not  maintained  in  the  absence  of  Gcn5.  Similarly,  Shh 
was  expressed  at  7.5  dpc,  indicating  specification  and  subsequent  loss  of  chordamesoderm  (data  not 
shown). 

Loss  of  mesodermal  derivatives  could  be  due  to  a  failure  of  proliferation  or  to  increased  cell 
death.  To  distinguish  between  these  two  possibilities,  we  examined  Gcn5  mutants  for  expression  of 
Proliferating  Cell  Nuclear  Antigen  (PCNA)  and  by  TUNEL  assay.  We  detected  equal  or  higher  amounts 
of  PCNA  in  Gcn5  mutants  at  8.5  dpc  compared  to  wild  type  embryos  at  7.5  or  8.5  dpc,  indicating  that 
cell  proliferation  is  not  affected  by  Gcn5  loss  (data  not  shown).  However,  TUNEL  assays  reveal  a 
strikingly  high  degree  of  apoptosis  occurs  in  the  mutants  (Fig.  5 c-d)  at  7.5  dpc.  Apoptotic  cells  are  rarely 
found  in  7.5  dpc  wild  type  embryos  (Fig.  5 a-b).  At  8.5  dpc,  more  than  25%  of  cells  (Fig.  5f)  are 
undergoing  apoptosis  in  the  mutants,  whereas  only  about  1%  of  cells,  mostly  clustered  in  the  midline  of 
the  neural  tube,  are  apoptotic  in  wild  type  embryos  (Fig.  5e).  Increased  cell  death  in  the  mutants  is 
restricted  to  the  embryonic  ectoderm  and  mesoderm  (Fig.  5c,  d  and  g).  Since  it  occurs  before 
morphological  defects  are  apparent,  this  increased  cell  death  is  likely  responsible  for  the  disappearance 
of  dorsal  mesoderm  lineages  in  Gcn5  mutants.  Importantly,  the  increase  in  apoptosis  does  not  reflect  a 
general  failure  in  transcription  since  Tcfl5  and  other  markers  are  expressed  at  7.5  dpc  in  the  Gcn5 
mutants. 

Lastly,  we  generated  mice  harboring  compound  mutations  for  Gcn5  and  Pcaf.  Mice 
heterozygous  for  Gcn5  and  null  for  Pcaf  are  morphologically  normal.  However,  mice  null  for  both 
HATs  die  around  7.5  dpc,  several  days  earlier  than  the  Gcn5  mutants.  Double  mutants  are  severely 
retarded,  and  some  yolk  sacs  are  filled  with  blood  (Fig.  6a).  These  embryos  are  observed  at  Mendelian 
ratios  at  7.5  dpc,  but  are  no  longer  found  after  8.5  dpc.  At  6.5  dpc,  double  null  embryos  are 
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developmentally  retarded  and  appear  arrested  at  the  egg  cylinder  stage  with  severely  disorganized 
embryonic  ectoderm  and  visceral  endoderm  (Fig.  6 b-c).  Since  the  combined  loss  of  Pcaf  and  Gcn5  leads 
to  more  severe  defects  than  observed  for  loss  of  Gcn5  alone,  Gcn5  and  Pcaf  must  share  overlapping 
functions  prior  to  6.5  dpc. 

Our  studies  yield  several  important  conclusions.  First,  although  Pcaf  physically  interacts  with 
CBP  and  p300^6,  the  lack  of  a  phenotype  for  the  Pcaf  mutant  mice  demonstrates  that  Pcaf  is  not 

absolutely  required  for  p300  or  CBP  functions^.  Second,  the  absence  of  Gcn5  results  in  loss  of  paraxial 
mesoderm,  chordamesoderm,  and  head  mesenchyme  due  to  a  dramatic  increase  in  apoptosis.  Lastly, 

although  Gcn5  also  physically  interacts  with  P300/CBP1,  the  phenotypes  of  mice  lacking  p300  are 

substantially  different^  from  those  lacking  Gcn5,  suggesting  that  Gcn5  functions  are  unique  and  are  not 
merely  supplementary  to  those  of  p300/CBP.  Rather,  these  different  phenotypes  indicate  that  individual 
HATs  are  required  as  cofactors  for  specific  deyelopmental  processes. 


Methods 

Pcaf  targeting.  We  isolated  a  portion  of  the  Pcaf  gene  by  screening  a  mouse  genomic  library  (Lambda 
FDOI,  Stratagene).  One  clone  (19.5  kb)  consists  of  two  exons,  which  are  127  bp  and  146  bp  in  length  and 
are  located  249  and  376  bp  downstream  of  the  translation  start  site  in  the  Pcaf  coding  sequence.  The 
latter  exon  was  targeted.  An  SV40  poly-adenylation  sequence  was  inserted  into  the  targeting  vector 
between  the  5’  and  3’  homology  arms.  Upon  homologous  recombination,  the  targeted  exon  was  replaced 
by  the  poly-adenylation  sequence  and  the  PGK  Neo  cassette  which  would  prematurely  terminate  the  Pcaf 
transcript.  The  length  of  the  targeted  exon  is  not  a  multiple  of  three  base  pairs,  so  in  the  event  that 
splicing  skips  the  targeted  exon,  the  resulted  cDNA  would  be  out  of  frame  for  the  Pcaf  protein.  For 
genotyping,  genomic  DNA  was  digested  with  Pstl  or  Xbal,  and  probed  with  a  1.3  kb  Pstl-Xbal  fragment 
(5’  probe)  or  a  0.65  kb  BamHL  fragment  (3’  probe)  respectively  (Fig.  2). 
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GcnS  targeting.  Gcn5  genomic  sequence  that  covers  the  entire  Gcn5  coding  region  was  isolated 

previously^.  A  1.5  kb  genomic  fragment  upstream  of  the  translation  start  site  was  used  as  5'  homology 
arm.  A  5.5  kb  fragment  was  excised  by  Xbal  and  Noil  to  generate  the  3’  homology  arm.  In  addition,  a 
LacZ  gene  was  inserted  into  the  Gcn5  locus,  where  its  transcription  was  to  be  driven  by  the  Gcn5 
regulatory  sequences  (Fig.  3).  Homologous  recombination  replaced  the  Gcn5  coding  region  with  the 
LacZ  gene  and  the  PGK-Neo  cassette.  For  genotyping,  genomic  DNA  was  digested  with  EcoRN  and 
probed  with  a  0.6  kb  Apal  fragment  5'  to  the  5'  homology  arm.  Similarly,  genomic  DNA  was  digested 
with  EcoRN  and  probed  with  a  0.5  kb  PCR  generated  fragment  3'  to  the  3'  homology  arm.  PCR  primers 
used  to  generate  the  3'  probe  are:  5’-  AGTAGGTGGTATGGCTTCTCAG;  3’-  CTAGAAGGCTCAGGA 
TACCATC. 

Creation  of  Pcaf  or  GcnS  null  mice.  GcnS  and  Pcaf  targeting  vectors  were  linearized  and  electroporated 
separately  into  129/SvEv  derived  ES  cells  AB1,  originally  generated  by  Dr.  A  Bradley  (Baylor  College 
of  Medicine).  G418  and  FIAU  double  resistant  cells  were  genotyped  by  Southern  blot  analyses  using 
Gcn5  or  Pcaf  probes  described  above.  Correctly  targeted  ES  cells  were  injected  into  3.5  dpc  C57BL/6J 
(Jackson  Lab)  blastocysts.  Male  chimeras  were  mated  with  wildtype  C57BL/6J  females.  Germline 
transmission  of  the  mutant  allele  was  determined  by  Southern  blot  analysis  of  tail  genomic  DNA  using 
the  probes  described  above.  Three  out  of  four  independent  Pcaf  targeted  ES  cell  lines  and  two  out  of 
three  independent  GcnS  ES  cell  lines  gave  rise  to  germline  transmission.  Heterozygous  mice  were 
intercrossed  to  generate  homozygous  null  mice.  Embryos  younger  than  8.5  dpc  were  genotyped  by  PCR. 
Primers  for  Gcn5 :  5’  -  AACCTGACCCTGGAGGATGCCAAG;  3’ 

GTGACGGCAC AGAAGACGATTTCC .  Primers  for  Pcaf.  5’  -  AAGCTGCTCACGTTTCTCACTT 
GG;  3'-GGGTAAGCTCTCACCTTGAATAGG.  Primers  iotNeo:  5'  -  TCCTGCCGAGAAAGTATCC 
AT;  3’  -  AGGAAAGGACAGTGGGAGTGG.  Primers  for  LacZ;  5'  -  GCATCGAGCTGGGTAATAA 
GGGTTGGCAAT;  3'  -  GAC ACCAGACCAACTGGTAATGGTAGCGAC . 
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Generation  of  Pcaf  and  Gcn5  double  mutants.  Gcn5  heterozygous  mice  were  crossed  with  Pcaf 
heterozygotes  to  create  mice  doubly  heterozygous  for  Gcn5  and  Pcaf.  Double  heterozygotes  were  mated 
with  Pcaf  null  mice  to  produce  mice  that  were  heterozygous  for  Gcn5  and  null  for  Pcaf.  Mice 
heterozygous  for  Gcn5  and  null  for  Pcaf  were  normal  and  intercrossed  to  generate  mice  that  were  doubly 
null  for  both  genes. 

RNA  analysis.  Total  RNA  from  whole  embryos  or  adult  mouse  tissues  was  isolated  and  electrophoresed 
on  1.5%  agarose  gels  containing  formaldehyde.  RNA  was  transferred  to  GeneScreen  Plus  membrane 

(NEN  Life  Science)  and  hybridized  with  Gcn5  and  Pcaf  cDNA  probes  as  previously  described  1. 
Histological  studies.  Mouse  embryos  were  fixed  with  4%  paraformaldehyde  overnight  at  4  °C,  and 
dehydrated  with  graded  alcohol.  Embryos  were  cleared  in  xylene,  incubated  in  1:1  xylene/paraffin  for  45 
min  at  65  °C,  washed  3X  with  paraffin  30  min  each  at  65  °C,  and  embedded  in  paraffin.  Sections  were 
made  at  7  pm  and  stained  with  hematoxylin  and  eosin  according  to  standard  procedure. 

Whole-mount  in  situ  hybridization.  Whole  mount  in  site  hybridization  of  embryos  was  performed  as 
described  by  Hogan  et  al.  1994^0.  Three  or  more  embryos  were  used  for  each  marker.  The  cDNAs  used 
to  generate  probes  used  have  been  previously  described^’  10,13-17  The  cDNAs  used  to  generate  probes 
for  Tcfl5 1 1  (paraxis )  and  Mef2c ^  were  kindly  provided  by  Dr.  E.  Olson. 

TUNEL  assay.  TUNEL  assay  was  performed  on  paraffin  sections  using  the  ApopTag  Plus  Peroxidase  In 
situ  Apoptosis  Detection  Kit  purchased  from  Intergen  according  to  the  manufacturer’s  protocol. 
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Figure  Legends 

Fig.  1  Differential  expression  of  Gcn5  and  Pcaf  in  mouse  embryos,  a-b,  total  RNA  was  isolated  from 
mouse  embryos  at  the  stages  indicated.  Northern  blot  hybridization  was  carried  out  using  either  a,  Gcn5 
or  b,  Pcaf  specific  probes.  Rps26  (ribosomal  protein  S26)  sequences  were  used  as  an  internal  control,  c, 
whole-mount  in  situ  hybridization  using  a  Gcn5  specific  probe.  Arrows  indicate  distal  allantois  (8.5  dpc 
embryo)  and  the  heart  (9.0  dpc  embryo),  dpc:  days  post  coitum. 

Fig.  2  Disruption  of  the  Pcaf  gene,  a,  targeting  strategy  for  the  Pcaf  gene.  A  5’  exon  (shown  in  black)  in 
the  Pcaf  gene  was  deleted  by  homologous  recombination  and  replaced  with  a  SV40  polyadenylation 
cassette  and  the  phosphoglycerate  kinase  ( PGK)-Neo  gene.  The  size  and  position  of  the  targeted  exon  is 
indicated.  Positions  of  the  5’  and  3’  probes  for  Southern  blot  analyses  are  shown.  Arrows  mark  the 
orientation  of  the  thymidine  kinase  (TK),  PGK-Neo,  or  PolyA  cassettes.  HAT:  acetyltransferase  catalytic 
domain.  Bromo:  the  bromodomain.  P:  Pstl;  X:  Xbal.  b,  genotyping  of  Pcaf  targeted  mice  by  Southern 
blot  hybridization.  Mouse  tail  genomic  DNA  isolated  from  wild  type,  Pcaf  heterozygous  and 
homozygous  mutant  mice  were  digested  with  Xbal  and  hybridized  with  3’  probe  shown  in  panel  a.  The 
genotypes  of  the  mice  are  indicated  on  top  of  the  panel.  The  sizes  of  the  wild  type  and  mutant  bands  are 
indicated,  c.  Total  RNA  was  isolated  from  tissues  of  wild  type  and  Pcaf  null  mice.  Northern  blot 
hybridization  was  performed  using  a  Pcaf  probe  that  is  3’  to  the  targeted  exon.  The  same  blot  was 
subsequently  re-probed  with  a  Gcn5  specific  probe  and  an  internal  control  Rps26  probe.  Arrows  indicate 
the  positions  of  Pcaf  Gcn5,  and  Rps26  signals. 

Fig.  3  Gcn5  null  mice  die  during  embryogenesis.  a,  targeting  strategy  for  disruption  of  the  mouse  Gcn5 
gene.  By  homologous  recombination,  the  entire  Gcn5  coding  region  was  removed  and  replaced  with  a 
promoterless  LacZ  gene,  which  is  placed  under  the  control  of  the  Gcn5  promoter.  ATG  marks  the 
translation  start  site  of  the  Gcn5  coding  sequence.  Arrows  indicate  the  orientation  of  the  LacZ  and  PGK- 
Neo  genes.  RV:  EcoRY.  b,  genotyping  of  Gcn5  targeted  mice  by  Southern  blot  hybridization.  Genomic 
DNA  was  isolated  from  wild  type  and  Gcn5  heterozygous  mutant  mice  as  indicated  and  digested  with 
EcoRY.  Southern  blot  hybridization  was  carried  out  using  the  5’  probe  shown  in  panel  a.  The  size  and 
position  of  the  wild  type  and  mutant  bands  are  indicated  by  arrows,  c,  whole-mount  in  situ  hybridization 
of  a  wild  type  and  Gcn5  null  embryos  using  a  Gcn5  riboprobe.  Gcn5  is  widely  expressed  in  a  7.5  dpc 
wild  type  embryo,  but  not  in  a  Gcn5  null  littermate.  d,  morphology  of  Gcn5  null  embryos  at  7.5,  8.5  and 
9.0  dpc.  Genotypes  of  the  Gcn5  alleles  are  indicated  below  the  embryos.  Mice  derived  from  two 
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independent  ES  cell  lines  showed  identical  phenotypes.  The  severity  of  the  mutant  phenotype  was 
slightly  variable,  es:  ectopic  structure;  al:  allantois. 

Fig.  4  Histologic  and  marker  analyses  of  Gcn5  mutant  embryos.  4a-/,  Paraffin  sections  were  prepared 
from  Gcn5  mutant  embryos  and  control  littermates,  and  stained  with  haematoxylin-eosin.  a-f,  transverse 
sections  (proximal  to  distal)  of  a  8.5  dpc  wild  type  embryo,  g-l,  transverse  sections  (proximal  to  distal)  of 
a  8.5  dpc  Gcn5  null  embryo,  m-u,  marker  gene  expression  in  Gcn5  null  embryos.  Whole-mount  in  situ 
hybridization  was  performed  to  examine  the  expression  of  marker  genes.  In  all  panels  the  embryo  on  the 
left  is  wild  type,  and  the  one  on  the  right  is  Gcn5  null.  Marker  genes  are  indicated  at  the  upper-left  comer 
of  each  panel.  Orientation  of  the  embryos  within  one  panel  is  consistent,  but  may  vary  in  different 
panels,  m,  T  (Brachyury)  is  detected  at  the  posterior  end  of  a  8.25  dpc  mutant  in  a  pattern  similar  to  that 
of  the  wild  type  littermate,  but  at  reduced  levels,  n,  Fgf8  is  expressed  at  the  posterior  end  of  a  8.25  dpc 
mutant  in  a  pattern  similar  to  that  of  the  wild  type  littermate.  o,  Tcfl5  {paraxis )  is  expressed  in  the 
presomitic  mesoderm  and  somites  of  a  8.5  dpc  wild  type  embryo,  but  its  expression  is  absent  in  a  Gcn5 
mutant  littermate.  p,  Tcfl5  is  expressed  in  the  paraxial  mesoderm  of  a  7.5  dpc  Gcn5  null  embryo, 
although  its  expression  is  reduced  compared  to  a  wild  type  littermate.  q.  Dill  is  expressed  in  the 
presomitic  mesoderm  of  an  8.5  dpc  wild  type  embryo,  but  not  in  a  Gcn5  mutant  littermate.  r,  Shh  ( Sonic 
hedgehog)  is  expressed  in  the  notochord  and  floor  plate  of  a  8.5  dpc  wild  type  embryo,  but  not  in  the 
Gcn5  mutant  littermate.  s,  Foxa2  ( Hnf-3b )  expression  is  detected  in  the  notochord  of  an  8.5  dpc  wild 
type  embryo.  Foxa2  expression  is  detected  at  the  distal  tip  and  ventral  lateral  region  of  the  mutant 
littermate,  but  not  in  the  anterior  midline  region,  t.  Twist  is  expressed  in  the  head  mesenchyme,  somites 
and  allantois  of  an  8.5  dpc  wild  type  embryo.  Twist  is  expressed  in  the  allantois  of  the  mutant  littermate, 
but  only  very  weak  signal  is  detected  in  the  ectopic  structure  and  the  dorsal  half  of  the  mutant  littermate. 
u,  Mef2c  is  expressed  in  the  primitive  cardiac  mesoderm  of  an  8.0  dpc  wild  type  and  its  mutant 
littermate.  al:  allantois;  am:  amnion;  cm:  cardiac  mesoderm;  ch:  chorion;  es:  ectopic  structure;  hf:  head 
fold;  hm:  head  mesoderm;  he:  heart;  gd:  gut  diverticulum;  me:  mesoderm;  ne:  neural  epithelium;  ntc: 
notochord;  ps:  primitive  streak;  psm:  presomitic  mesoderm;  so:  somites. 


Fig.  5  Apoptosis  is  significantly  increased  in  Gcn5  null  embryos.  Paraffin  sections  were  prepared  from 
7.5  dpc  wild  type  embryos  in  both  (a,  al)  sagittal  and  ( b )  transverse  orientation,  c  and  d,  7.5  dpc  Gcn5 
null  embryos  in  transverse  orientation,  e,  el  and  e2,  8.5  dpc  wild  type  embryos  in  transverse  orientation. 
f,fl,  8.5  dpc  Gcn5  mutants  with  both  frontal  and  g,  transverse  views.  Extraembryonic  tissues  of  7.5  dpc 
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embryos  from  heterozygous  matings  were  isolated  for  genotyping,  and  the  remaining  embryonic  portions 
of  the  embryos  were  processed  for  TUNEL  assays.  Apoptotic  cells  are  brown.  Boxed  areas  in  panels  a,  e, 
and / are  magnified  in  panels  al,  el,  e2,  and  fl.  ec:  ectoderm;  me:  mesoderm;  en:  endoderm. 


Fig.  6  Mice  null  for  both  Pcaf  and  Gcn5  die  around  7.5  dpc.  a,  photomicrograph  of  a  freshly  dissected 
7.75  dpc  litter  from  a  Pcaf~!~  Gcn5+/~  intercross  shows  two  double  null  embryos  (arrows)  with 
littermates.  b  and  c,  histological  analysis  of  Pcaf  Gcn5  double  null  embryos.  Paraffin  sections  were 
prepared  from  double  mutant  embryos  and  control  littermates,  and  stained  with  haematoxylin-eosin.  b. 
Transverse  sections  of  6.5  dpc  Pcaf  null  Gcn5  wild  type  and  c,  double  null  littermates  Genotyping  on 
serial  sections  was  performed  using  radioactive  section  in  situ,  rm:  Reichert’s  membrane;  ee:  embryonic 
ectoderm;  pc:  proamniotic  canal;  ve:  visceral  endoderm;  ysc:  yolk  sac  cavity. 
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